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PREFACE 


The primary aim of this book is to bring together what the well-educated 
person needs to know about the professional activities of psychologists in busi- 
ness, industry, advertising and marketing, education, clinical practice, law, 
government, and the military services. The book does not presume to give 
advice on how to treat neuroses, bring up children, handle employees, or live 
one’s life. It seeks rather to give a comprehensive view of the work of applied 
psychologists, that is, all psychologists other than those engaged primarily in 
teaching or basic research within an academic setting. Although directed 
principally to the college student, the book is also appropriate for beginning 
graduate students in psychology, as well as for students in schools of business 
and possibly in schools of law and medicine. It should likewise be of special 
interest to personnel workers, advertisers, and businessmen in general who want 
an overview of what psychology has to offer in practical contexts. 

Throughout the book, attention is focused on the specialized research 
methodologies which represent the psychologist’s most distinctive contribution 
to each field. To this end, each area of professional activity is illustrated with 
one or more major research projects that provide the scientific foundation for 
the particular professional applications, What the student can most profitably 
learn from science is method, and this is especially true in a young and rapidly 
changing science like psychology. By focusing on method, moreover, the book 
provides the reader with the needed skills for critical evaluation of psychological 
findings, conclusions, and practices encountered in daily life. A special effort 
has been made to alert the reader to the types of methodological loopholes, 
fallacies, and misinterpretations likely to occur in popular writings on applied 
psychology and in the claims of charlatans and inadequately trained practi- 
tioners. 

Another major aim of the book is to present an integrated and comprehensive 
picture of applied psychology. Combining such disparate fields as clinical, 
engineering, and consumer psychology within a single book makes it possible 
to perceive relations among their findings that have previously gone un- 
noticed. Fresh insights often emerge from the juxtaposition of similar problems 
that have been studied in different contexts by investigators with diverse points 
of view. For this reason, cross references have been used liberally throughout 
the book, and every effort has been made to note connections and unify the 
various areas. 

Still another aim has been to give the reader a sense of the historical con- 
tinuity of the field. Although applied psychology has undergone explosive 
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development during and since World War II, it did not originate at that time. 
A deliberate effort has been made to include outstanding early experiments 
and to give the antecedents of certain characteristic modern developments, 
such as teaching machines, human-factors research in equipment design, or 
the psychosomatic approach to physical disorders. 

Mention should also be made of the way in which references are utilized. 
"Throughout the book, references are given, not only to identify sources of data 
and ideas, but also to suggest further readings for anyone interested in delving 
more deeply into a topic. The extensive bibliographies at the end of each 
chapter have been selected partly with this objective in mind. In the text 
itself, references are cited only by number in order not to clutter the page with 
names and dates of little intrinsic interest to the college student. 

The author is indebted to many colleagues for their contributions to the 
preparation of this book. The following psychologists read one or more chapters 
and made many constructive suggestions: Edwin A. Fleishman, Joseph D. 
Matarazzo, Arthur W. Melton, Charles K. Ramond II, E. J. Shoben, Jr., 
S. D. S. Spragg, J. Wayne Wrightstone, and Joseph Zubin. Among the many 
persons who graciously provided relevant material in the form of unpublished 
reports, reprints, illustrations, or extensive correspondence, special mention 
should be made of the following: Leslie J. Briggs, Brant Clark, Lee J. Cron- 
bach, Theodore W. Forbes, Robert M. Gagné, James J. Gallagher, Mary 
Agnes Gordon, Louis M. Heil, Jane D. Hildreth, Arthur Koponen, Milton 
Levine, Joseph P. Loftus, Jr., Bernard Lubin, Arthur A. Lumsdaine, John M. 
McKee, Mildred B. Mitchel, Norman L. Munn, D. Morgan Neu, Charles K. 
Ramond II, James J. Regan, Horacio J. A. Rimoldi, Sol M. Roshal, Morton 
A. Seidenfeld, Karl U. Smith, Lawrence N. Solomon, Julius Wishner, 
J. Wayne Wrightstone, and Joseph Zubin. 

To my colleagues in the Fordham University Psychology Department I am 
grateful for their continued interest in the progress of the book and their readi- 
ness to discuss questions and provide pertinent materials; special thanks are 
extended to Joseph F. Kubis, William G. Lawlor, S. J., Dorothea McCarthy, 
and Henryk Misiak. I am happy to acknowledge the contribution of my 
husband, John P. Foley, Jr., to the solution of countless problems at all stages 
in the preparation of the book. Finally, I am indebted to Margaret Tighe of 
the Fordham University library staff, for her competence and dedication in 
tracking down elusive references and providing other bibliographic services, 
to Josephine McMahon for her skillful assistance in indexing, and to Kath- 
leen Gentile, whose handling of typing and proofreading tasks transformed 


what might have been routine chores into a responsible and constructive 
performance. 


Anne Anastasi 
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Introduction 


What Is 
Applied Psychology? 


Applied psychology is coming to play a vital role in more and more facets of 
modern society. Prior to World War II, the majority of psychologists were 
engaged in university teaching and basic research. Today over half of the 
members of the American Psychological Association are working in business, 
industry, clinics, hospitals, schools, community agencies, government, or the 
military services. It is the object of this book to provide an overall view of 
the types of functions performed by such applied psychologists. Within each 
field, attention will be focused upon what psychologists as such can contribute 
to the solution of problems that are the joint concern of many different special- 
ists. 


FIELDS OF APPLIED PSYCHOLOGY 


The varied and growing activities of psychologists cannot be readily classified 
into neat and exclusive categories. The multiplicity of ways in which psycholo- 
gists perceive themselves is well illustrated by the more than twenty divisions 
of the American Psychological Association (see Appendix A). Some of these, 
like the Divisions of General and of Experimental Psychology and the Divi- 
sion on the Teaching of Psychology, clearly draw most of their members 
from university faculties. Others, like the Divisions of Industrial, Consumer, 
Clinical, Counseling, Consulting, and Military Psychology, the Division of 
Psychologists in Public Service, and the Division of School Psychologists, are 
composed chiefly of applied psychologists. Still other divisions represent inter- 
est areas that cut across teaching or basic research and professional applications. 
The fact that many APA members belong to more than one division is in 
part a reflection of the overlapping and changing roles played by contem- 
porary psychologists. A few years ago a concerted effort was made to simplify 
the divisional structure of the APA. Several classifications were proposed that 
were simpler and logically more elegant than the existing one. But the project 
was abandoned when it was recognized that a set of coordinate, well-balanced, 
and mutually exclusive categories was neither realistic nor practicable. 
Traditionally, the term “applied psychology” has been most closely identi- 
fied with the applications of psychology in business and industry. The Journal 
of Applied Psychology, established in 1917, is still concerned largely with 
industrial and business psychology. Early textbooks, as well as college courses 
entitled “Applied Psychology,” which began to appear at the time of World 
War I, usually covered problems of personnel selection, efficiency of work, and 
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advertising. The first two of these areas were included in 1913 in a book on 
Psychology and Industrial Efficiency (3o)! by Hugo Münsterberg, a Harvard 
professor who has been called the first all-round applied psychologist in 
America. Miinsterberg did much to define the field of applied psychology and 
to outline the possible uses of psychology in business and industry, as well as 
in other areas. 

Another pioneer in applied psychology was Walter Dill Scott, whose book 
on Psychology of Advertising (37) appeared in 1908. With his appointment 
as professor of applied psychology at Carnegie Institute of Technology in 
1915, he became the first psychologist to hold such a title in an American 
university. Scott initiated research in several areas of personnel selection and 
in 1919 founded The Scott Company, the first consulting firm in this country 
devoted to industrial psychology. The reader interested in historical perspec- 
tives can find a comprehensive list of landmarks of professional psychology, 
from 1860 to 1939, in a handbook edited by Fryer and Henry (16, pp. vii-x). 
These landmarks include antecedent developments in general psychology that 
contributed to the rise of applied psychology, as well as noteworthy events 
within the entire area of applied psychology itself. A very detailed account of 
the rise and growth of industrial psychology in America, with special reference 
to the contributions of the Carnegie Institute group, is given in a 12-volume 
survey by Ferguson (14). 

Within industrial psychology, later developments led to the differentiation 
of three major areas. These areas grew out of three corresponding traditions 
within general psychology. The area of personnel selection evolved from 
psychological testing and differential psychology; that of human engineering, 
from experimental psychology; and that of employee relations, from research 
on personality and social psychology (21). 

Personnel psychology, covered in the first major division of this book 
CPart ID, includes personnel selection and classification procedures, the closely 
related area of training? and employee relations. Although some psychologists 
specialize in one or another of these arcas, most psychologists qualified to work 
in one of them can and frequently do work in the others. There is consider- 
able similarity in the interests, training, and experience of personnel psy- 
chologists concerned with any of these arcas. 

A distinctly different group, however, is engaged in engineering psychology, 
treated in Part III, Such psychologists deal with problems of work and fatigue, 
the improvement of work methods, conditions in the working environment— 
such as illumination, ventilation, and noise—and the design of equipment for 
more effective use by the human operator. Some psychologists identify the 
field of engineering psychology wholly with equipment design; others use the 
term more broadly to cover the other areas mentioned. Engineering psychology, 
whether in the broader or more restricted sense, has also been called by several 
other names, including “human engineering," "applied experimental psychol- 
ogy," "biomechanics," "biotechnology," "human-factors research," and "ergo- 


1 Throughout the book, the numbers in parentheses correspond to the numbers of the 
references listed at the end of each chapter. 

? The application of learning principles and experimental techniques to industrial train- 
ing ve lagged far im the development of the three major areas cited above, al- 
though many experimental psychologists participated in resea h ili ini 
during and after World War II. - P e dts 
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nomics.” The term “engineering psychology,” however, has been most widely 
accepted, especially in America. As can be surmised, psychologists in this field 
work in close collaboration with engineers. 

In industry, the activities of both personnel and engineering psychologists 
have been traditionally sponsored by management. More recently, increasing 
interest has been evidenced in the possible roles of psychologists in trade 
unions (see, e.g., 12). Working either under joint union-management auspices 
or directly for labor unions, psychologists could conduct investigations in such 
familiar areas as employee attitudes, placement, work methods, working en- 
vironment, and equipment design. They could study internal organizational 
and administrative problems within the unions themselves. And they could 
conduct research on the process of collective bargaining and labor-management 
negotiations. So far only meager beginnings have been made in any of these 
directions. 

Advertising and consumer psychology, discussed in Part IV, also started as 
a part of business and industrial psychology but now represents a distinct 
field. Beginning at the turn of the century in laboratory studies of individual 
features of advertisements, this field has broadened to include extensive re- 
search on consumer characteristics, wants, and preferences. Such research 
merges into studies of economic behavior on the one hand and product design 
on the other. With regard to methodology, consumer surveys also have much 
in common with public opinion polling. Psychological work in these areas has 
many points of contact with economics and sociology. 

In Part V, the reader is introduced to clinical psychology. One chapter con- 
siders techniques for the diagnosis of emotional and intellectual disorders; 
another, therapeutic procedures; and a third, research in clinical psychology. 
Although the first psychological clinic in America was established by Lightner 
Witmer at the University of Pennsylvania as early as 1896, the practice of 
clinical psychology dates largely from World War II. Prior to that time, 
clinical psychologists were chiefly mental testers, and most of their clients were 
children. The years since World War II have witnessed a phenomenal rise 
in the number of psychologists working in the clinical area, as well as an 
expansion of their activities to include more work with adults and more 
psychotherapy at all age levels. Research on clinical problems has also in- 
creased sharply both in quantity and in methodological sophistication. 

That the work of psychologists often defies classification is well illustrated 
by the distinction between clinical psychology and counseling psychology 
(Part VI). In general, the counseling psychologist is concerned with milder 
forms of maladjustment than is the clinical psychologist. He deals, too, with 
a variety of problems encountered by essentially normal persons. Nevertheless, 
the functions of many individual clinical and counseling psychologists un- 
doubtedly overlap. 

Since vocational counseling constitutes a major part of the functions of many 
counseling psychologists, special attention will be given in Part VI to the 
nature of vocational choices, the factors affecting them, and their significance 
in the life of the individual. It is noteworthy, too, that the field of counseling 
psychology, although now much broader, originated in the vocational-guidance 
movement (see 6). The first vocational-guidance bureau was opened in Boston 
in 1908 by Frank Parsons, whose book on Choosing a Vocation (32) was 
published the following year. The National Vocational Guidance Association 
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was organized in 1913. And in 1914 the first vocational-guidance unit in a 
public school system was established in Cincinnati. In its subsequent growth, 
the vocational-guidance movement itself was greatly influenced by the develop- 
ment of psychological testing. Still later, the emerging and broader field of 
counseling psychology incorporated many concepts and procedures originating 
within clinical psychology. 

Part VII, covering psychology and other professions, is concerned with arcas 
in which psychologists function chiefly as consultants to other professional per- 
sonnel. These areas include education, medicine, and law. Psychologists also 
contribute indirectly to these fields through the training of professional person- 
nel, particularly in education. It is characteristic of these fields that, although 
potentially they provide many opportunities for the application of psychology, 
their contacts with psychology have remained relatively undeveloped. But this 
situation is changing. Today there are many indications of a growing interest 
in the fuller utilization of psychology in all these fields. 


THE SCIENCE AND PROFESSION OF PSYCHOLOGY 


Applied psychology does not differ in any fundamental way from the rest of 
psychology. In terms of training and orientation, every applied psychologist is 
a psychologist first and an applied specialist secondarily. This point of view 
was reaffirmed in a conference on graduate education in psychology held in 
1958 under the auspices of the American Psychological Association (19). 
Psychology is probably alone in its status of "professional science" or "scien- 
tific profession," and that, the conference concluded, is the way psychology 
should remain if it is to be maximally effective. Psychologists are not sharply 
separated into basic scientists and professional practitioners. All have had 
essentially the same types of courses; specialization occurs chiefly at the level 
of practicum training and on-the-job experience. The clinical, counseling, or 
personnel psychologist, for example, has received training in experimental 
psychology, and the experimental psychologist in his research utilizes concepts 
or hypotheses that may have originated in the clinic or factory. 

The unique contribution that the psychologist can make in business, in- 


dustry, government, hospitals, clinics, schools, and other practical settings 
stems principally from his research approach to problems of human behavior. 
When th 


€ psychologist goes to work in an applied context, he brings with him, 
not a set of rules or specific facts, but a method for attacking problems. This 
is essentially the scientific method, common to all sciences. The applied 
psychologist has been trained in the many techniques for utilizing this basic 
method in the study of human behavior. It is for this reason that the major 
emphasis in the present book will be placed on the specific methodologies 
appropriate to each field of applied psychology. 

What, specifically, is meant by the statement that the applied psychologist 


uses the scientific method? First, like his colleagues in all fields of psychology, 
he relies on empirical observation rather than subjective opinion. His choice 
of a personnel selecti 


on test, training method, or type of therapy is not based 
on his “superior wisdom” as a psychologist, nor on the recommendation of 
leading authorities, standard texts, or “hundreds of satisfied clients.” Instead, 
the psychologist gathers data to evaluate empirically the effectiveness of differ 
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ent tests, training methods, or forms of therapy. He will, of course, draw upon 
his previous knowledge and whatever wisdom he possesses to choose promising 
procedures and to formulate testable hypotheses. His familiarity with psy- 
chological facts and principles and his prior experience in similar practical 
situations will greatly facilitate the selection or development of effective pro- 
cedures. But their effectiveness cannot be assumed—it always needs to be 
demonstrated. 

It is also characteristic of the scientific method that full records of observa- 
tions are made without delay, with the aid of mechanical devices when 
feasible. Even when relying on his unaided observation, however, the psy- 
chologist does not trust to memory for his facts. As a psychologist, he is 
particularly aware of the selectivity of attention and memory. We tend to 
notice and to recall just those observations that support our preconceived 
expectations or biases and to forget contradictory facts. Or we may recall the 
most dramatic instances, which are likely to be atypical and less representative 
than the more readily forgotten ordinary cases. 

Another prerequisite of scientific observation is replication. Observations 
must be repeated. Generalizing from a single observation or from a few cases 
is especially hazardous in psychology because of the extensive individual 
differences in behavior, as well as the variability in the responses of the same 
individual on different occasions. Many repetitions are needed to establish a 
conclusion with adequate certainty. 

But numbers are not enough. Statistics may lie when improperly employed. 
The cases observed must be representative of the groups about which con- 
clusions are to be drawn. The samples utilized must be carefully scrutinized for 
possible selective bias. Perhaps the most vivid demonstration of the effect of 
such selective bias is provided by public opinion polls that ^went wrong" in 
their predictions because certain income levels or other categories of persons 
happened to be either overrepresented or underrepresented in the samples 
interviewed. 

One of the chief distinguishing features of the scientifically trained applied 
psychologist is that he never fails to specify the margin of error of his pro- 
cedures. No technique is perfect; and the extent of error inherent in each must 
be taken into account in any administrative decisions based upon its applica- 
tion. Such error itself must be empirically determined before the technique is 
ready for operational use. An example of the type of chance errors that affect 
psychological measurement is provided by temporal fluctuation. Thus a job 
applicant might score lower on a test on one day because he is sleepy and 
depressed, and higher on another day when he feels alert and confident. Or 
he might perform better on one form of the test and poorer on another because 
of the particular choice of items in the two forms, thereby illustrating another 
type of measurement error. 

Still another example is to be found in the use of personnel selection tests 
to predict an applicant's job performance. Since so many factors influence an 
employee's achievement on the job, it is obvious that no test could make such a 
prediction with perfect accuracy. Some variation between the predicted and 
actual outcome is inevitable. Through follow-up studies, it is possible to 
determine the magnitude of such prediction errors for any given test and any 
given job. 

A different kind of chance error can be seen in an attitude survey, when one 
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sample of 5o employees reports more dissatisfaction with management policies 
than another sample of 5o. Even though both samples were chosen with equal 
care to represent a cross section of the company's employees in all important 
characteristics, the first sample happened to include more disgruntled indi- 
viduals than the second. Any statistical measure, such as an average, per- 
centage, or correlation, should be accompanied by some indication of the 
fluctuation in its value to be expected from sample to sample. With such a 
sampling error, we are able to estimate the limits within which results in other 
samples are likely to fall. Thus we might be able to predict that the percentage 
of dissatisfied employees in the company falls between 24 and 32, even though 
we cannot say definitely whether it is 27, 31, or some other specific figure. 

Although all of the above procedures are characteristic of the scientific 
method, the very heart of the method is to be found in the design of experi- 
ments that permit observations under controlled conditions. Only under such 
circumstances can we identify cause-and-effect relations. To establish the 
effectiveness of a particular form of therapy, it is not enough to report that 
63 per cent of the patients recovered. We must also know what percentage 
recovered in an untreated control group equated in all relevant respects with 
the treated group. As long as the two groups differed in only one respect, 
namely, the presence or absence of a given treatment, then differences in out- 
come can be attributed to the treatment. 

It should not be inferred, however, that the scientific method is limited to 
the study of a single variable at a time. With more complex experimental 
designs, it is possible to investigate simultaneously the effects of two or more 
variables, as well as the interaction between them. In Table 1 is a hypothetical 
example of an experimental design in which two methods of teaching arithmetic 
are employed systematically with three groups of children differing in IO. Of 
the 4o subjects in Group I, with IQ's between 120 and 140, 20 are taught by 
method A and 20 by method B. The 4o subjects in Group II are similarly 


TABLE 1 Schematic Illustration of Interaction 
between Teaching Method and Ability Level 


Group Method A Method B 

I. IQ 120-140 N,—20 No = 20 
M,= 41 M,=59 

IL IQ 90-110 Ng = 20 N,-20 
Mz = 38 M4-22 

II. IO 60-80 N, = 20 Ng = 20 
5:21 M= 8 


er: rrr ——— 


assigned to the two teaching methods, and the same procedure is followed 
with Group Ill. The mean scores obtained by the six subgroups, showing 
amount of improvement following the course, indicate that method B is more 
effective than method A with the brighter subjects CGroup D), while the 
reverse is true of the less able groups. If the appropriate statistical analyses 
reveal that these differences exceed those expected by chance, we can con- 
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clude that there is a significant interaction. between teaching method and 
ability level? The joint analysis of both variables thus permits the discovery 
of relationships that would have gone undetected if either variable had been 
investigated separately. 

Since applied psychologists follow the same fundamental procedures as 
their laboratory colleagues in solving problems, how can they be differentiated? 
When the applied psychologist is engaged in research—as is true much of the 
time for many applied psychologists—the distinction hinges on the difference 
between basic and applied research. This difference is one of degree. Al- 
though examples can be found that fall clearly into the basic or applied 
categories, some research could be classified in either. One determining factor 
is how the problem is chosen: to help in theory construction or in making 
administrative decisions. For instance, is the investigation concerned primarily 
with the nature of learning or with the most effective method for training 
airplane pilots? 

A closely related difference pertains to the specificity or generality of the 
results. The findings of basic research can usually be generalized more widely 
than those of applied research. The latter characteristically yields results per- 
taining only to narrowly limited situations. Similarly, applied research permits 
less analysis of causal relationships, since situations are likely to be compared 
in their totality, rather than being broken down into more elementary com- 
ponents. For instance, applied research may demonstrate that one of two train- 
ing procedures, in all their operational complexities, yields better results than 
the other within a particular context. But it may not be possible, under these 
circumstances, to specify why the one method was superior. It should be added 
that the sponsorship or setting of a research project does not necessarily indi- 
cate its nature. Some research conducted under industrial or government 
auspices, or in hospitals or clinics, is clearly basic rather than applied. 

Some applied psychologists spend all or most of their time in service rather 
than research functions. How does such a psychologist use scientific method 
when engaged, for example, in psychotherapy with a neurotic patient, in test- 
ing a child with a reading disability, in interviewing an applicant for an 
executive position, or in planning a consumer survey for a toothpaste manu- 
facturer? He does so in at least two ways. First, he evaluates and chooses his 
procedures, not on the basis of subjective opinion, but in the light of available 
published research conducted in comparable situations. Being unable under 
the circumstances to carry out his own empirical evaluations, he nevertheless 
is guided by whatever empirical data are available regarding tests, question- 
naire techniques, types of psychotherapy, or any other procedure he plans to 
use. 

In a still more general way, the psychologist's scientific training influences 
his approach to every problem he encounters in his service functions. Through 
such training, he has formed habits of accurate and objective observation and 
recording of facts and has learned to guard against preconceived notions and 
premature acceptance of conclusions. He has learned to reserve judgment and 
to modify or discard conclusions as fresh facts become available. In his service 


3 The statistical procedure required for this purpose is analysis of variance. Introduc- 
tions to it can be found in any recent text on statistics, such as Edwards (10), Fer- 
guson (13), Garrett (17), Guilford (20), and McNemar (28). More detailed treat- 
ment of experimental design is provided by Edwards (9) and Ray (34). 
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functions as in his research, the applied psychologist often needs to formulate 
hypotheses and verify them against subsequent data. In trying to discover thc 
sources of a patient's neurotic fear, the causes of a child's reading difficulty, 
or the leadership potential of a job applicant, the psychologist forms a succes 
sion of tentative hypotheses. Each hypothesis suggests what further facts must 
be elicited and is in turn either confirmed or disconfirmed by such facts.* 

To be sure, not all applied psychologists have profited so fully from thei: 
scientific training. But it is the goal of such training to foster this type of 
problem-solving attitude. Such an attitude represents an important difference 
between the professional psychologist and either the well-meaning but un 
trained "psychologizer" or the deliberate impostor. 


THE PROBLEM OF CHARLATANISM 


Some of the areas in which applied psychologists work have traditionally 
proved attractive to charlatans, pseudoscientists, and self-styled experts. In 
this flourishing psychological underworld are found phrenologists, physiog- 
nomists, and others who try to diagnose your aptitude and personality traits or 
prescribe a specific career by examining the shape of your skull, the angle of 
your jaw, or the color of your eyes. Here, too, are the many systems for im 
proving your personality, developing your memory or your selling ability, 
overcoming your fears and other emotional problems, and generally attaining 
health-wealth-and-happiness for the payment of a small fee (see 40). On the 
borderland are the practitioners who assume a semblance of scientific re- 
spectability by administering tests. They may, however, rely on a single test of 
unknown validity for which extravagant claims are made. Or they may send a 
batch of tests by mail, which are taken under uncontrolled conditions and 
interpreted without any other knowledge about the individual. All these 
practices represent flagrant misuses of tests and yield meaningless results. 

Charlatanism is against the public interest under whatever guise it is prac- 
ticed. But charlatans have often called themselves psychologists, thus further 
confounding the public and undermining its confidence in psychology. When 
the glowing promises go unfulfilled and the anticipated benefits fail to 
materialize, psychology gets the blame. Some charlatans hold "degrees" from 
diploma mills and dubious correspondence schools offering bizarre or token 
curricula. Several of these schools grant a degree called "Doctor of Psychology," 
which to the unsuspecting public may sound even more convincing than the 
"Doctor of Philosophy in Psychology" held by genuine psychologists. The 
Ph.D. title itself is used by a still larger number of these questionable institu- 
tions C7, 25), thus increasing the confusion with the legitimate Ph.D. degree 
granted by accredited universities. 

The prevalence of charlatanism in psychology arises partly from the wide- 
spread misconceptions and lack of knowledge about the nature of psychology 
itself (see, e.g., 39). Psychology is one of the youngest of the sciences. Modern 
experimental psychology dates from about 1875; and until quite recently few 
psychologists worked outside of universities, Consequently, the general public 


4 For a provocative discussion of the relati 


onship between th hologist" ice and 
research roles, see Rychlak (36). e psychologist's service an 


What Is Applied Psychology? 


is still vague about the functions of psychologists and is thus more receptive 
to the claims of the charlatans. 

Even more important in accounting for the spread of charlatanism is 
people's strong desire for the sort of help the charlatans offer them. People 
believe the claims—however fantastic—because they want to believe them. 
And the charlatan, unhampered by facts, promises easier and more satisfying 
solutions to life's problems than does the reputable psychologist. The charlatan 
offers shortcuts to self-evaluation, quick cures for your neuroses, a new 
personality in ten easy lessons, and decisive vocational advice in a neat package 
—all with the bland assurance of 99% per cent accuracy. The hard realities 
of science look unattractive against such prospects. 

Still another reason for the spread of pseudopsychology is to be found in 
the lack of adequate legal controls for the title of psychologist. This condition, 
however, is rapidly being remedied. Since World War II, nearly a half of 
the states have enacted certification or licensing laws for psychologists, and the 
number is steadily increasing (see 24 and Appendix B). 

Among their clients, charlatans include not only individuals seeking per- 
sonal help but business organizations as well. In fact, certain consulting firms 
using highly questionable procedures specialize in personnel selection for 
business and industry. Even the more obvious forms of quackery, such as 
phrenology and physiognomy, have not been without adherents among business 
executives (e.g., 5). Such gullibility is not so prevalent today as it was early 
in the century, when psychology was largely unknown to the layman. But it 
is still encountered. As recently as 1953, for instance, Fortune, a magazine 
read widely by businessmen, published a serious and sympathetic account of 
a personnel selection system based on facial characteristics (41). The article 
refers to the use of this system by a number of high-level business executives 
as an aid in personnel decisions. The personnel and industrial-relations director 
of one large manufacturing company, for instance, is quoted as saying that 
this system “is one of the greatest contributions ever made to the field of 
executive selection and development” (41, p. 146). 

Efforts to judge personality and aptitudes from physical appearance date 
back to antiquity. Today this accumulated folklore survives, not only in the 
organized systems of charlatanry, but also in common social stereotypes. 
Literature and art have done much to perpetuate these stereotypes. And our 
very language makes it difficult to escape them. Thus we associate “highbrow” 
or “egghead” with the intellectual, red hair with a fiery temper, and a square 
jaw with firmness of character. These familiar examples illustrate one source 
of popular stereotypes, namely, analogy and superficial resemblance. Red is 
the color of fire, and a square jaw looks more solid and firm than a receding 
chin, Similarly, the association between head size or shape and intelligence is 
bolstered by the vague knowledge that the brain plays an important part in 
behavior. 

A little knowledge about the brain was also the basis for the doctrine of 
phrenology, formulated by the eighteenth- and early nineteenth-century 
anatomist Franz Gall. Going far beyond the meager data then available on the 
localization of cortical functions, Gall proposed that different areas of the 
brain control specific and complex traits, such as mechanical ingenuity, con- 
scientiousness, and combativeness. He maintained further that over- or under- 
development of these traits could be diagnosed by examining the protuber- 
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ances on the skull. The identification of a particular "bump" was taken to 
mean that the function allegedly controlled by the corresponding cortical area 
was highly developed in the individual. In the hands of its later exponents, 
Gall's theoretical formulation degenerated into a popular system for quick 
personality assessment and vocational guidance (15). 

An examination of the rationale underlying phrenology will serve to illustrate 
the sort of fallacious reasoning prevalent in such pseudoscientific systems. 
First, the assumed correspondence between skull shape and brain shape ignores 
the cerebrospinal fluid and the several layers of membrane that intervene 
between brain and skull. Second, size is not a dependable index of the degree 
of development of different parts of the brain. Efficiency of function is more 
likely to be related to complexity of interrelations among neurones and other 
microscopical characteristics of nerve matter. Third, the type of trait that 
phrenologists ascribe to different brain areas is quite unlike the functions 
identified through research on cortical localization. Connections have been 
demonstrated between certain muscle groups or sense organs and specific 
brain areas, the very areas where phrenologists localize such traits as veneration 
and parental love. 

Fourth, and probably most important, is the nature of the evidence cited 
by phrenologists in support of their doctrine (15). Such evidence vividly 
illustrates the popular fallacy of citing selected cases. In their published 
writings, phrenologists amassed examples of eminent men whose cranial 
protuberances corresponded to the alleged cerebral location of their special 
talents. Similarly, they cited cases of intellectual leaders who had unusually 
large skulls. A favorite example is Daniel Webster, whose massive skull 
measured over 24% inches in circumference and who had a cranial protuber- 
ance in the exact spot where phrenologists locate literary talent. But of course 
the phrenologists ignore all the other eminent men with average or small skulls 
and with bumps in the wrong places. Phrenologists also refer to microcephalic 
idiots, who have abnormally small skulls and very low intelligence. But they 
overlook the other clinical varieties of mental deficiency with normal skulls 
or with characteristically large skulls. Investigations conducted by psychologists 
on large, unselected samples of children and adults have revealed very low 
correlations between cranial capacity and intelligence (4, ch. 5; 33, ch. 3). 
Similarly, precise measurements of head shape have shown no consistent re- 
lationships with either general intellectual level or special aptitudes (4, ch. 5; 
22, ch. 4; 33, ch. 3). 

Closely related to phrenology is the doctrine of physiognomy, under which 
are loosely grouped a number of characteristics pertaining to facial structure, 
hand shape, skin texture, and hair and eye color. Drawing partly from ancient 
lore and partly from selected examples of the physiognomies of famous men, 
Johann Lavater wrote extensively on this system in the eighteenth century. 
Early in the present century, Katherine Blackford Cs) elaborated certain 
aspects of it with special reference to its use in personnel selection. In this 
form, the system enjoyed considerable vogue for a number of years. 

In this area, too, psychologists have conducted several investigations designed 
to check on various alleged associations between physical and psychological 
traits (22. ch. 4; 33, ch. 7). Carefully measured facial dimensions and indices 
were correlated with aptitudes and personality characteristics as determined 
by tests or associates ratings. Among the physical traits investigated were 
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convexity of profile, height of forehead, blondness, and a variety of other 
facial measures chosen to test traditional claims of physiognomists. All the 
relationships investigated yielded insignificant correlations. It might be added 
that when psychologists invited practicing physiognomists or phrenologists to 
participate in controlled investigations, they repeatedly met with refusal (see, 
€.g., 22, p. 121). 


WHO IS AN APPLIED PSYCHOLOGIST? 


The prevalence of charlatanism provides one reason for inquiring into ways 
of identifying genuine psychologists. In addition, the phenomenal growth of 
applied psychology since World War II means that more and more psycholo- 
gists are working with laymen or with members of other professions who may 
lack the technical knowledge to evaluate the psychologist's qualifications. 
Hence it has become increasingly important to set up standard procedures for 
identifying psychologists and judging their special competencies. 

Typically, the psychologist holds the Ph.D. degree in psychology from the 
graduate school of an accredited university. He should not be confused with 
a psychiatrist or psychoanalyst. The psychiatrist is trained in medicine. After 
receiving the M.D. degree, he undergoes several years of supervised experi- 
ence in the diagnosis and treatment of mental disorders. His preparation thus 
parallels that of other medical specialists, such as the surgeon or pediatrician. 
The psychoanalyst may be either a psychiatrist or a psychologist who utilizes 
a particular approach to the treatment of emotional disorders, an approach 
originating in the psychoanalytic theories of Sigmund Freud. The clinical 
psychologist, like psychologists in other fields, has earned a Ph.D. degree in 
psychology, with additional experience in working with maladjusted persons. 
More will be said in Part V of this book regarding the functions of clinical 
psychologists and their relations with psychiatry. 

Some psychologists in all fields have only the M.A. degree. But the pro- 
portion of psychologists at this training level is decreasing as standards im- 
prove and become more formalized. Persons with the M.A. degree in psychol- 
ogy are often qualified to function as psychological technicians, in a limited 
arca or under supervision, rather than as professional psychologists (19). 

A significant step, both in raising professional standards and in helping the 
general public to determine who is a psychologist, has been the enactment of 
licensing and certification laws for psychologists. Before World War II, there 
was virtually no legal control over either the practice of psychology or the use 
of the title "Psychologist." Today, most of the states in which large numbers 
of psychologists are employed either have such laws or are in the process of 
enacting them (see 24). In such states, psychologists may be identified by 
the possession of a certificate indicating that certain standards have been met. 
In New York State, for example, the requirements include the Ph.D, degree 
in psychology from an accredited university, two years of relevant experience, 
and the passing of a written qualifying examination. The law is administered 
by the State Education Department, with the assistance of a committee of 
psychologists. 

At a still higher level, specialty certification is handled by the American 
Board of Examiners in Professional Psychology (ABEPP), established in 1946. 
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Requiring the Ph.D. degree, five years of experience, and intensive written 
and practical examinations, this board issues diplomas in three specialties: 
clinical, counseling, and industrial psychology. ABEPP publishes a directory 
of diplomates in each specialty, which is available in libraries and may also 
be obtained by interested individuals or organizations (see Appendix B). The 
function of ABEPP is essentially to provide information on qualified psy 
chologists. As a privately constituted board within the profession, it has no 
legal authority for enforcement. The various state laws pertaining to psy- 
chology, on the other hand, do carry such authority. In a state having such 
laws, a person without the appropriate certificate who holds himself forth as 
a psychologist is subject to legal penalties. 

ABEPP was originally established by the American Psychological Associa 
tion (APA), the national association to which most psychologists belong (sec 
Appendix B). Membership in this association does not imply that the psy- 
chologist is certified for the performance of any service functions—only state 
certificates and ABEPP diplomas serve that purpose. Nevertheless, the annual 
directory of the APA is a source of considerable information about the loca- 
tion, training, and professional functions of its members. 'The APA has also 
undertaken many other activities designed to improve and systematize train- 
ing standards, raise the level of professional practice, work out effective rela- 
tions with other professions, and clarify the public image of psychology. An 
example is the participation of an APA committee in the preparation of a 
booklet, Psychologists in Action (31), designed to introduce the layman to 
the work of psychologists. Another booklet prepared under APA auspices is 
The Psychologist in Industry (2), dealing with one major area of applied 
psychology. 

Of particular importance was the formulation of a professional code of 
ethics, Ethical Standards of Psychologists (11). First published by the Ameri- 
can Psychological Association in 1953 and revised in 1963, this code is espe- 
cially relevant to the problems encountered by applied psychologists in their 
relations with the public. The abridged, 1963 revision is reproduced in Ap 
pendix C. 

A final question that may be asked about applied psychologists is where 
they work. Through what channels do they bring their specialized techniques 
to the public? Many industrial psychologists operate through consulting firms 
staffed largely by psychologists. Most of these organizations were established 
after World War II, often by groups o 
in psychological work in the Armed Forces. Some consulting firms are broader 
in scope, having been formed by business consultants or engineers but em 
ploying a number of full-time psychologists. All perform services for industry 


United States were employed by consulting firms (27, p. 707). 

Other psychologists are employed directly by business and industry. They 
can now be found in practically every type o A 5 
turing, retailing, transportation, advertising, insurance, and many others. An 
example of an industry-wide cooperative organization that has been using 
the services of psychologists for many years is the Life 
Management Association CLIAMA). With a psychologi 


director, this association develo 
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for life insurance salesmen, conducts employee attitude surveys, and performs 
other personnel research functions. 

Many clinical or counseling psychologists are employed by hospitals, clinics, 
and counseling bureaus. State institutions for mental defectives or for emo- 
tionally disturbed patients, Veterans Administration hospitals, out-patient 
clinics, and community counseling centers are examples of such organizations. 
A much smaller number of psychologists work in prisons or in schools for 
juvenile delinquents (8, 38). In most of these correctional institutions, psy- 
chological services are hampered by serious understaffing. A recently defined 
and growing specialty area is that of the school psychologist, employed by 
public school systems and by some private schools. In addition, a considerable 
number of counseling psychologists and some clinical psychologists work in 
school systems. 

Full-time private practice claims a small proportion of psychologists. In 
a 1960 survey (3), slightly less than 10 per cent of all psychologists were 
classified as self-employed. Among the clinical psychologists, approximately 
14 per cent were in private practice. A one-third random sample of the 
Division of Clinical Psychology of the American Psychological Association, 
also surveyed in 196o, yielded a somewhat larger percentage (26). In this 
group, 17 per cent reported that they were engaged in full-time private 
practice, and an additional 12 per cent that they had part-time private prac- 
tices. Because the private practitioner works without the systematic and 
close contacts with other professional personnel that are provided in an in- 
stitutional setting, he should meet especially high standards of qualification 
(see 1). Preferably, he should hold an ABEPP diploma in his specialty. 

A thriving area of applied psychology is represented by military psychol- 
Ogy. In 1957, it was estimated that about 5 per cent of all APA members 
were working for the Army, Navy, or Air Force as full-time military or 
civilian personnel (29). The functions of these psychologists resemble most 
closely those of industrial psychologists. They include chiefly personnel se- 
lection and classification, training, military management and morale, and 
human engineering (18, 29). Some basic research is conducted by military 
psychologists, especially in connection with human engineering problems. 
Certain units in the military services also include clinical psychologists. 

A particularly large group of psychologists is employed by civilian govern- 
ment agencies, such as the United States Civil Service Commission, United 
States. Employment Service, United States Public Health Service, Depart- 
ment of Health, Education, and Welfare, Department of Agriculture, and 
Veterans Administration. Beginning in 1922, when the first government 
psychologist, L. J. O'Rourke, was appointed director of research in the 
United States Civil Service Commission (23), the employment of psycholo- 
gists in public service has shown prodigious growth. In a spot check con- 
ducted in 1955, it was estimated that approximately 33 per cent of all psy- 
chologists in the APA were employed by government agencies (35). This 
estimate includes several previously mentioned categories, such as military 
services, state hospitals, and public school systems.? Within the total govern- 
ment service group, about 14.5 per cent were employed by the Federal 
5 Not included, however, are psychologists teaching: in state universities or city col. 


leges. Were this group to be added, the total number of APA members employed by 
the government would rise to 48 per cent (35). 


16 


INTRODUCTION 


government, 11.4 per cent by cities and counties, and 7.4 per cent by state 
governments. 

Finally, mention should be made of the place of applied psychology in 
universities. Most universities have counseling centers where counseling and 
sometimes clinical psychologists provide service for the universitys own 
students. Where an active clinical training program is under way, the uni- 
versity may also have hospital facilities, clinics, and counseling centers that 
serve the public. Some universities have bureaus of industrial psychology 
which operate much like a consulting firm, carrying out contract research 
and development programs for industry or government. To this should be 
added the contract research conducted by individual faculty members, usu- 
ally under government sponsorship. Although many of these individual re- 
search grants are for basic research, some clearly belong in applied psychology. 


SUMMARY 


Before World War II, Psychology was primarily an academic discipline. 
Today, over half of America’s psychologists work outside of universities. 
Employed largely by government agencies, industry, school systems, hospitals, 
clinics, and counseling centers, these applied psychologists are contributing 
in such areas as personnel selection and training, employee relations and 
management practices, the improvement of work methods and of the work- 
ing environment, equipment design, advertising and consumer research, the 
diagnosis and treatment of personality disorders, and vocational and other 
types of personal counseling. Some psychologists work with members of 
other professions, as consultants or in the training of other professional 
personnel. This relationship is currently illustrated in the fields of education, 
law, and medicine. 

Like all psychologists, the applied psychologist is trained in the basic sci- 
ence of psychology. His unique contribution stems from his application of 
the scientific method to problems of human behavior. The experimental 
method common to all the sciences consists essentially in the empirical observa- 


applied psychologist is influenced in many ways by his training in scientific 
method. In this regard he differs from the charlatan, who has traditionally 
invaded several areas of applied psychology. Such professional developments 
as state certification of psychologists, the establishment of the American Board 
of Examiners in Professional Psychology, and the formulation of Ethical 
Standards of Psychologists represent significant steps in combating charlatan- 
ism and in clarifying the public image of psychology. 
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Concerned with the general area of personnel psychology, Part II of this 
book deals with the application of psychology to the selection and training 
of personnel, as well as to problems of employee relations and management 
practices. The first three chapters of this part are devoted to personnel selec- 
tion, covering in turn: (1) basic problems and procedures in developing a 
selection program; (2) the place of psychological tests in selection; and 
(3) selection procedures based on personal interactions and biographical data. 
In industry, the techniques of personnel selection are used not only in the 
hiring of new employees but also in connection with promotions, discharges, 
transfers, and other personnel decisions. 

Nor are such techniques limited to industrial psychology. Essentially the 
same procedures are applicable to the selection and classification of military 
personnel, the admission of students to colleges and professional schools, or 
the identification of patients most likely to benefit from a particular form 
of therapy. Although first discussed in this section within the context of 
industrial psychology, these techniques provide an introduction to methodol- 
ogy that is of basic importance in most fields of applied psychology. For this 
reason, they will be examined more fully than more specialized procedures 
of limited application. 


ANALYSIS 


The development of a personnel selection program illustrates the basic 
problem-solving approach followed by the applied psychologist. Essentially, 
this approach includes four steps: (1) gathering information to define the 
problem; (2) formulating hypotheses; (3) testing hypotheses; and (4) trans- 
lating the findings of the preceding step into administrative procedures. In 
the development of personnel selection techniques, definition of the problem 
is achieved through a job analysis. The purpose of such a job analysis is to 
provide a description of what workers do on the job for which selection 
procedures are being developed. The description should cover operations per- 
formed, equipment used, conditions of work, hazards and other special char- 
acteristics of the job, rate of pay, nature and amount of training required, 
opportunities for promotion or transfer, relation to other jobs, and any other 
pertinent information. The information included varies somewhat with the 
job, the company, and the use that is to be made of the job analysis. In se- 
lection research, the job description provides the basis for the job specifica- 
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tion, or list of worker qualifications for which tests or other selection devices 
must be prepared. An example of a job description prepared by the United 
States Employment Service for a relatively simple job, that of hand dry 
cleaner, is given in Appendix D. 

To be effective, a job analysis must be specific to the job being described. 
A description in terms of vague generalities that would be equally applicable 
to many jobs is of little use for purposes of personnel selection. Viteles (49), 
a pioneer in the development of job analysis techniques, highlighted this 
point by citing the following description of a librarian: “A love of reading 
is not a major qualification since a librarian has little time for reading. She 
needs accuracy, quickness, neatness, a pleasing appearance and the applica- 
tion necessary to do much routine work. All these qualities must be founded 
upon a sincere and active enthusiasm for library work” (49, p. 76). Com- 
menting upon this superficial description, Viteles noted that it would be 
just as accurate if in place of librarian we were to substitute other job titles, 
such as cashier, file clerk, or dressmaker. Yet the requirements of these jobs 
are actually quite different. The job analyst must focus on specific require- 
ments that differentiate the particular job from other jobs. 

An even more important characteristic of an effective job analysis is that 
it indicate those aspects of performance that differentiate most sharply be- 
tween the better and the poorer workers. In many jobs, workers of different 
levels of proficiency may differ little in the way they carry out most parts 
of their jobs—only certain features of the job may bring out the important 
differences between successes and failures. In his classic book on Aptitude 
Testing, Hull stressed the importance of these differentiating aspects of job 
performance, which he called “critical part-activities’ (30, p. 286). More 
recently, this concept has been reemphasized by Flanagan (19, 20) under 
the name of “critical requirements.” 

Although for the present purpose we are considering job analysis as the 
first step in developing a personnel selection program, it should be noted that 
it has other important uses in industry. It can serve as a basis for setting up 
or modifying training programs, improving operating procedures, reducing 
accident hazards, and redesigning equipment. Job analyses also help in 
arriving at job evaluations for adjusting pay rates. For this purpose, different 
jobs are rated with regard to certain major factors, such as training and ex 
perience required, physical and psychological demands, degree of responsi- 
bility, and working conditions. By assigning weights to these factors, one can 
obtain an over-all evaluation for each job, which is then used in setting up the 
wage scale.! Still another application of job analysis is to be found in the prepa- 
ration of job descriptions and classifications used in vocational counseling. 
More will be said about such sources of occupational information in Part VI. 

For whatever purpose conducted, job analysis requires systematic data- 
gathering techniques if it is to yield usable information. When job informa- 
tion is sought in order to develop selection procedures, the available sources 
can be classified into seven types. First, the job analyst may consult pub- 
lished analyses of similar jobs. This procedure corresponds to the literature 
survey that customarily precedes any research project. When available, pub- 
lished job analyses may provide promising leads and suggestions, but theit 


1 For a discussion of different systems of job evaluation, see Otis and Leukart (28), 
Patton and Littlefield (39), and Tiffin and McCormick (47, ch. 17). 
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limitations are obvious. Although possibly similar, the previously analyzed 
jobs are rarely identical with the one under consideration. Moreover, the 
same title may connote very different jobs, Well-known examples are the 
job titles of “secretary” and “engineer,” each of which covers a multiplicity 
of widely varying jobs. Another difficulty is that verbal descriptions of job 
activities may have little meaning for the job analyst unless supplemented 
by more direct contacts with the job. 

A second source of job-analysis data is to be found in manuals and rec- 
ords that may be available for the particular job. These include instructional 
and operating manuals specifying the procedures to be followed by the 
worker in various phases of the job. Among the performance records that 
would be useful are those containing qualitative descriptions of common 
errors, learning difficulties, or reasons for failure. Similarly, clues as to 
characteristics conducive to failure on the job can be obtained from an 
analysis of exit interview reports. Held regularly with employees who are 
leaving the company, these interviews often cover the reasons for job termi- 
nation, whether initiated by the employer or employee. Records of customer 
complaints provide still another source of information about the problems 
encountered on a particular job. 

A third and very common type of jobanalysis technique involves inter- 
views with workers, supervisors, or job instructors. Interviews with success- 
ful and unsuccessful workers, with employees having different amounts of 
Job experience, and with learners at different stages of training are especially 
fruitful. Foremen, instructors, and other supervisors can often provide still 
other kinds of information. Such interviews have a number of advantages 
as job-analysis techniques. It is obvious that persons who have been carrying 
on the job are in a good position to report what occurs on such a job. More- 
over, the participation of workers and supervisors in the job analysis gives 
them a better understanding of what the psychologist is trying to do and 
increases their ultimate acceptance of the results of the project. On the other 
hand, the worker, however experienced in job activities, may have little in- 
sight into the reasons for his own or other workers’ success or failure. Nor 
can he readily translate job activities into worker qualifications. Semantic 
problems may also interfere with communication. Such terms as "good judg- 
ment,” "sense of humor," or “dependability” may have very different. con- 
notations for different persons. For all these reasons, such interviews should 
cover objective facts as much as possible, focusing on what workers do and 
what happens. 

An attempt to introduce more objectivity into workers' reports is illus- 
trated by a fourth technique, which may be described as a systematic activity 
log. Rather than relying on memory during an interview, the worker is re- 
quired to keep a running record of all the tasks he performs during a speci- 
fied period of time. Such a procedure reveals many activities that may be 
overlooked when one relies on the recollections of a small sample of workers, 
as was demonstrated in an early analysis of the job of secretary (11). First, 
the members of the research staff together with six secretaries recorded all 
the secretarial duties they could recall. The list they compiled contained 
166 items. Next a chart was prepared covering one working week, each 
day being divided into smaller time units. Copies of this chart were distri- 
buted to 125 secretaries with the instructions to record each activity in the 
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appropriate space as it was performed. The resulting list revealed 87: dif- 
ferent duties, in contrast to the 166 on the first list. 

Forgetting undoubtedly accounts for part of this difference. That variation 
in duties on different secretarial jobs may also contribute to the difference, 
however, was suggested by a third part of the study. A survey of 715 secre 
taries with the complete checklist of 871 duties showed that only 3 per cent 
of the duties were common to the reports of three-fourths or more of the 
secretaries. Systematic performance logs kept by a large sample of workers 
can undoubtedly reveal job information not readily obtainable in other ways. 
There are many jobs, however, in which the keeping of such records would 
not be practicable. 

A procedure that utilizes some of the features of the systematic activity 
log is the criticaLincident technique described by Flanagan (20). Essen- 
tially, this technique calls for factual descriptions of specific instances of 
job behavior that are especially characteristic of either satisfatory or unsatis- 
factory workers. It thus centers attention on the previously discussed critical 
job requirements, leaving out of consideration job behavior that does not 
differentiate successful from unsuccessful workers. Depending on the na- 
ture of the job and other circumstances, the incidents may be reported by 
supervisors, co-workers, subordinates, or other associates. To ensure frank- 
ness in reporting, anonymity of the data should be preserved. The informa- 
tion is frequently obtained in an interview or in group-administered or 
mailed questionnaires. To avoid memory errors, the incidents may be re- 
corded as they occur over a designated period, such as two weeks. Growing 
out of Air Force research during World War II, the critical-incident tech- 
nique has been employed with such varied groups as commercial airline 
pilots, research personnel in physical science laboratories, factory workers, 
dentists, and department store salesclerks. 

A sixth technique that is always desirable—even if only to supplement 
other procedures—is direct observation by the job analyst. This technique 
may involve no more than unaided personal observation, as when the job 
analyst accompanies a house-to-house salesman or a credit investigator on his 
rounds. When applicable, sound recordings, motion pictures, and other ob- 
jective performance records may be utilized. In the analysis of manual opera- 
tions, the job analyst may use the techniques of time-and-motion study, to 
be discussed in Part III. To be sure, the mere presence of an observer may 
affect performance somewhat in many types of jobs. Nevertheless, this type 
of observation provides the job analyst with a fuller and more vivid picture 
of job activities than could be obtained through more indirect sources. 

For the same reason, a seventh source of information is sometimes utilized, 
namely, performance of the job by the analyst. Obviously, the job analyst 
can spend only a short time working on any one job. Moreover, this tech- 
nique is not feasible with certain kinds of jobs, such as those requiring 
special skills or long training periods or those involving risk from an in- 
experienced operator. Early examples of job analyses based on extensive job 
performance include studies of department store salesclerks (6), taxi drivers, 
streetcar motormen, and assembly-line operators (see 3o, p. 286). Today, 
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PREPARATION AND PRELIMINARY 
TRYOUT OF PREDICTORS 


Job analysis enables the personnel psychologist to define the problem by 
specifying the critical worker requirements for success in the given job. Such 
an analysis may suggest, for example, that the job demands finger dexterity, 
perceptual speed, computational accuracy, and the ability to work effectively 
under confusing and distracting conditions. The next step is to prepare 
tests, questions, rating scales, or other selection devices designed to assess 
the applicant's standing in these traits. All such selection instruments are 
known as predictors, since they are used to predict each individual's subse- 
quent job performance. 

The predictors assembled at this stage may include previously available 
published tests, instruments developed specially for this project, or a combi- 
nation of the two. In choosing a predictor or in preparing a new one, the 
psychologist is in effect formulating a series of hypotheses, namely, that the 
score on each test or the answer to each question on an application or recom- 
mendation form is significantly related to eventual success on the given job. 
The correctness of these hypotheses depends upon several factors, such as the 
effectiveness of the job analysis in bringing out important job requirements, 
the proper utilization of previous research data obtained with available in- 
struments in similar job situations, the psychologists ingenuity in devising 
new instruments, and his general familiarity with psychological facts and 
principles, But although much can be done in advance to increase the chances 
that correct hypotheses will be formulated, it is only by an empirical follow-up 
that their correctness can be demonstrated. 

Before embarking on a full-scale follow-up of the predictors, however, the 
psychologist will usually submit them to preliminary tryouts. Some predictors 
may be discarded and others modified as a result of such tryouts. Using groups 
similar to the applicant population for which the predictors are designed, we 
can check on time required, clarity of instructions, suitability of vocabulary 
and reading level, layout of printed matter, and similar questions. 

Selection programs usually include some aptitude tests. In the preliminary 
tryout of such tests, it is particularly important to obtain information regard- 
ing their dispersion, difficulty level, reliability, and specificity. The first two 
characteristics can be checked by examining the distribution of scores obtained 
by a representative sample of persons. Obviously, if everyone achieves the 
same or nearly the same score, the test is of little use. The object of a test is 
to discriminate among individuals. Thus, the wider the variability or dis- 
persion of scores, the better is the test fulfilling its function. Similarly, if the 
difficulty level of the test is suitable for the group, the scores will tend to 
fall into a symmetrical distribution, with the largest number near the center 
of the range and a decreasing number as the extremes are approached. With 
large samples, the distribution approximates the bell-shaped normal prob- 
ability curve (see Figure 2, p. 30). If, however, the test is too difficult for 
the group, the scores will pile up at the low end; if it is too easy, they will pile 
up at the high end. In either case, the distributions will be appreciably skewed 
rather than normal. 

The reliability of a test refers to its consistency. Will individuals retain the 
same relative position in the group if examined on two occasions with the 
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same finger dexterity test? Obviously, if there were broad shifts in position, so 
that the person who was best in the group on one day scored near the bottom 
when retested on another day, the test could have no value as a predictor of 
subsequent job performance. As suggested by the above example, the temporal 
stability of a test can be determined by comparing the scores obtained by the 
same subjects on two administrations of the test. 

Another aspect of test reliability pertains to consistency of scores on parallel 
forms of the test. Although designed to measure the same ability, such parallel 
forms are composed of different items. For example, two forms of a vocabulary 
test might yield slightly different scores because some individuals happen to 
know more words on one list, while others know more words on the other 
list. This type of reliability can be measured by comparing the scores obtained 
by the same persons on two forms of a test. When two forms are not available, 
the reliability is often estimated by comparing scores on two halves of a single 
form, such as the scores on odd and even items. 

In the event that the test cannot be objectively scored but requires evalua- 
tion or interpretation of responses by the examiner, a further type of reliability 
that should be checked is scorer reliability. This can be done by having a 
sample of papers independently scored by two examiners and computing the 
degree of scorer agreement between the two sets of scores. Scorer reliability 
is more likely to represent a problem in certain types of personality tests than 
in aptitude tests, although it needs to be considered in some of the latter too. 

Whatever the type of reliability under consideration, it can be expressed 
in terms of a correlation coefficient? This statistical technique, which is used 
for many purposes in psychological research, shows the degree of relationship 
or correspondence between two sets of scores. The correlation coefficient, con 
ventionally designated by the symbol r, provides a single index of the close 
ness and direction of relationship within the whole group. It can vary from 
+1.00 (a perfect positive correlation), through o, to — 1.oo (a perfect negative, 
or inverse, correlation). A +1.00 correlation means that the individual receiv 
ing the highest score in one variable also receives the highest score in the 
other, the one who is second best in the first is second best in the second, etc., 
each person falling just as far above or below the mean in one variable as he 
does in the other. A —1.00 correlation, on the other hand, indicates that the 
highest score in one variable is paired off with the lowest in the other, a 
corresponding perfect reversal occurring throughout the group. A zero corre 
lation signifies no relationship at all between the two sets of scores, or the 
sort of arrangement that would result if the scores were shuffled and paired 
off at random. 

The correlations found in actual practice are almost never equal to +1.00 
or —1.00, but fall on some intermediate value. The numerical size of the 
coefficient represents the closeness of relationship, in either a positive or 
negative direction. An example of high positive correlation is given in Figure 1. 
This figure shows a “bivariate distribution,” indicating the scores obtained by 
each of 100 persons on Forms A and B of a test. Each tally mark shows the 
score of one subject on both forms. It can be seen, for example, that seven 


* Procedures for computing correlation coefficients can be found in any elementary 
statistics book, such as those cited in Chapter : (p. 9). For further discussion of test 
reliability, see any recent text on psychological testing, such as Anastasi (2) or Cron 
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Fig. 1 Bivariate distribution yielding a correlation of .94. 


persons obtained scores between 30 and 39 on both forms, while three scored 
between 30 and 39 on Form A but between 40 and 49 on Form B. When 
computed, the correlation between the two forms proves to be .94. Such a 
correlation indicates a strong tendency for individuals to maintain the same 
relative standing on the two forms, although it can be seen that the cor 
respondence is not perfect. 

Correlation is also useful in checking the specificity of different tests. 
“Specificity” refers to the fact that the tests used in a selection program should 
measure separate and independent abilities in order to avoid wasteful duplica- 
tion. Each test should be chosen to cover a different aspect of the job per- 
formance that is to be predicted. A test yielding a single score based on a 
combination of several aptitudes would be less useful than a set of separate 
tests measuring relatively pure and independent functions. The latter arrange- 
ment permits a more efficient and flexible use of data. 

Specificity can be checked by correlating the scores obtained by a group 
of persons on the different tests. When two or more tests correlate too highly 
among themselves, one of them can be chosen on the basis of other considera- 
tions, such as dispersion, difficulty level, and reliability. To be sure,.two or 
even more tests covering the same function may be included at this stage if 
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the object is to see which will prove best in the subsequent follow-up. The 
choice among them would then be postponed until the next step. In any 
event, more predictors of all sorts should be included at the outset than we 
ultimately plan to retain, so that we may adopt for operational use those that 
prove most effective in the follow-up. 


VALIDATION OF PREDICTORS 


It is apparent that the preliminary tryouts discussed in the preceding section 
involve the testing of several hypotheses regarding difficulty level, reliability, 
and other features of the predictive instruments. The major task of hypothesis 
testing, however, is concerned with the relationship between predictors anl 
job performance. All other hypotheses regarding predictors are secondary. The 
job performance that we set out to predict is known as the criterion. The 
validity of a predictor is its correlation with the criterion. Validation consists 
essentially in the administration of the predictors to a representative sample 
of job applicants or newly hired employees and the checking of each indi 
vidual's initial scores against his subsequent criterion status. The intervening 
period may be six months, a year, or several years, depending upon the nature 
of the job and available facilities. Although several types of validity are 
recognized in test construction, predictive validity is the most relevant to 
personnel selection (see 2, ch. 6; 45). 

When limitations of time or other circumstances make a longitudinal study 
impracticable, validity may be determined by administering the predictors to 
present employees and checking scores against current criterion status. This 
procedure, yielding a measure of concurrent validity, is not so satisfactory for 
the validation of personnel selection instruments and must be regarded as a 
makeshift. Since unsatisfactory employees tend to leave, a group of present 
employees is not a representative sample of the applicant population on which 
the instruments are to be used. Moreover, present employees have had varying 
amounts of experience on the job, and such experience may affect their test 
performance. 

The Criterion. A selection program can be no better than the criterion 
against which it is validated. Industrial psychologists regard the criterion prob 
lem as of prime importance, at the same time recognizing that criterion 
measures employed in most personnel research leave much to be desired. Job 
success is a complex concept, and it can be assessed in many ways. Production 
records provide a criterion measure available for many kinds of jobs. Quantity 
and quality of output, waste through spoilage of materials, amount of sales, 
and merchandise returned are among the records that can be used for this 
purpose. Length of time the individual remains on the job represents a more 
comprehensive criterion measure, especially important in jobs having rapid 
turnover. Whether an employee is discharged for incompetence or whether he 
leaves spontaneously, he contributes to turnover costs. In certain jobs, per- 
formance in training provides useful criterion data. Other sources include 
records of accidents, absenteeism, tardiness, and disciplinary actions. 

Regardless of how much objective information may be available, it is always 
desirable to include ratings among criterion measures, Depending upon the 
nature of the job, such ratings may be procured from supervisors, co-workers, 
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subordinates, instructors, or other personnel. Many companies have merit rating 
programs as part of their administrative procedures. In other companies, 
ratings may have to be secured specially for criterion purposes. If obtained 
under satisfactory conditions, ratings represent a particularly good source of 
criterion data. Specific rating techniques, as well as ways of reducing common 
judgment errors, will be discussed in Chapter 4. 

With any type of criterion data, we must guard against criterion contam- 
ination. Production records, for instance, may be influenced by extraneous 
factors, Thus one factory worker may turn out fewer articles than another 
because he has poorer equipment, or one salesman may sell more life insurance 
than another because he operates in a higher-income territory. Either the 
criterion measures should be obtained solely from individuals working under 
similar conditions, or they should be adjusted in some way for existing 
inequalities, 

Criterion contamination may also occur through the influence of the 
predictors themselves upon the criterion. For example, if a foreman knows 
that one employee scored low and another high on an aptitude test, this 
knowledge may affect the ratings he assigns them for job performance. Such 
influences will artificially inflate the correlation between test scores and 
criterion. It is therefore imperative that anyone who participates in the assign- 
ment of criterion ratings have no access to test scores or other predictor data. 
While the predictors are being validated, they must not be used for opera- 
tional purposes in selecting or evaluating personnel. 

For a realistic and comprehensive picture of each employee’s job perform- 
ance, several sources of criterion data should be combined. The simplest way 
to do so is to express all criterion scores in comparable units and add them to 
find an over-all criterion index for each person. In this way, for instance, we 
could combine information about quantity of output, number of rejects, ab- 
senteeism, and foreman’s ratings. To convert scores into comparable units, we 
may express them as standard scores, so called because they use as their unit 
the standard deviation of the group. The standard deviation is a measure of 
variability or extent of individual differences within the group. It is computed 
by finding each person’s deviation from the group mean and squaring it. The 
standard deviation (SD) is the square root of the mean of these squared 
deviations. 

Standard scores express each individual’s standing in the group as so many 
SD's above or below the group mean. Thus, if the mean is 3o and the SD 5, 
an individual with a score of 35 falls 1 SD above the mean and receives a 
standard score of +1.00. Figure 2 shows where these successive SD units fall 
in a normal curve. Reference to this figure indicates, for instance, that an 
individual with a standard score of +1.00 exceeds the lower 84.13 per cent 
of the group (50 per cent below mean plus 34.13 per cent between mean 
and 1 SD). Similarly, by consulting tables of normal curve frequencies, we 
may find the percentage of cases exceeded by standard scores of +1.5, —0.8, 
+2.1, or any other given value. These positions will be equivalent in different 
distributions, provided the distributions approximate the normal curve. 

Different criteria usually vary in importance with reference to the objectives 


SD is also symbolized by the Greek letter sigma (e). 


39 


PERSONNEL PSYCHOLOGY 


99.72% i 
95.44% | 
I i 68.26% } 
| 
" i | | 
$ | 
E | | l 
p | 
5 | 
ce 
S | | | | 
= 34.13% | 34.13% | 
F i | 
5 | | | | 
= | | | | 
| | | 13.59% 
=38D =250 ISD Mean "SD +280 *35D 


Fig. 2 Standard scores in a normal curve. 


of the job. Accordingly, weights may be assigned to each standard score by 
multiplying it by a constant before adding. Still another way of combining 
criterion scores is in terms of their estimated dollar value—the so-called 
"dollar criterion" (9). This procedure applies cost accounting techniques to 
criterion measures. For example, rate of production, amount of material wasted, 
and time lost by absences can each be expressed in terms of cost to the company 
and these dollar values added. A difficulty arises in assigning dollar values to 
such factors as customer goodwill or departmental morale. But in trying to 
assess these intangible effects of an employee's behavior, the dollar estimates 
may be no more subjective than ratings expressed in other quantitative terms. 

Determining Validity. The validity of each predictor may be determined 
by analyzing the relationship between predictor scores and criterion scores. 
Suppose the validation sample includes two contrasted groups of life insurance 
salesmen, one composed of highly successful salesmen, the other of ncar 
failures. With such a dichotomous, or twofold, criterion, we may compare the 
mean predictor scores obtained by the two groups. If the successful salesmen 
achieve higher predictor scores than the unsuccessful, then the predictor has 
some validity for this criterion. 

To answer the question conclusively, we need to determine the significance 
of the difference between the means of the two groups. Measures of statistical 
significance are widely used in psychological research, being required in 
nearly all types of investigations. Statistical significance refers essentially to 
the chances that similar results will be obtained if the investigation is repeated. 
How likely is it that the successful salesmen would again score higher than 
the unsuccessful if another sample were tested and followed up? A certain 
amount of variation in results will always occur through sampling error, as 
well as through the unreliability of measuring instruments discussed in an 
earlier section. Every investigation utilizes only a sample of the total relevant 
population. In personnel selection research, for instance, the sample on which 
validity is determined is never the same as the groups on whom the predictors 
will be used operationally. How far, then, can we generalize from the valida- 
tion sample to other samples? 


To find the statistical significance of a mean difference, we need to know 
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Fig. 3 Expectancy table showing relation between pilot stanine and 
elimination from primary flight training, (From Flanagan, 18, p. 58.) 


the obtained difference and the SD's of the two groups compared. From these 
data we can compute the sampling fluctuation to be expected for each mean 
separately, as well as for the difference between them. The end product of 
these computations is usually a ratio (critical ratio, or t ratio) between the 
difference and the standard error of the difference. With such a ratio, we can 
look up the probability (P) that the difference is small enough to have 
resulted from chance errors. If this probability is less than 1 out of roo 
CP < o1), we generally conclude that the difference is large enough to be 
significant. In other words, we conclude that in other samples the difference 
would still favor the same group Ce.g., the successful salesmen). The chances 
of such a conclusion being wrong are less than 1 out of 100. It might be 
added that a lower level of significance, such as .o5, is sometimes accepted. 
In every investigation, however, the level used is specified. 

A finer breakdown of the relation between predictor and criterion may be 
presented in the form of an expectancy table, illustrated in Figure 3. This 
chart shows the percentages of student pilots who failed primary flight train- 
ing within each stanine on the pilot selection battery developed by the Air 
Force. A stanine (contraction of “standard nine”) is essentially a standard 
score expressed on a nine-point scale. In such a scale, the mean is 5, and the 
SD is approximately 2. Figure 3 shows that 77 per cent of the men with the 
lowest stanine score (1) were eliminated, while only 4 per cent of those 
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receiving the highest stanine (9) failed to complete the course satisfactorily. 
The percentage of failures decreases consistently over the intervening stanines. 
The percentages given in Figure 3 provide an estimate of the probability that 
student pilots tested subsequently would pass or fail. For example, it can be 
predicted that approximately 40 per cent of candidates scoring 4 will fail and 
60 per cent will pass. Similar statements can be made about the expectancy 
of success or failure on the part of individuals. Thus an individual with a 
pilot stanine of 4 has a 60:40, or 3:2, chance of completing primary flight 
training. 

The most precise expression of the relation between predictor and criterion 
is provided by a validity coefficient. Such a coefficient is customarily com- 
puted when both predictor and criterion scores are in the form of continuous 
variables. In that case, the relation can be represented in a bivariate distribu- 
tion such as that illustrated in Figure 1 and the correlation computed by the 
same method. Other types of correlations, however, can be employed when 
predictor, criterion, or both are in the form of dichotomies or of multiple 
categories. These techniques are described in standard statistics texts. Like all 
statistical measures, validity coefficients must be evaluated for significance. If 
we find that a validity coefficient is significant at the .or level, we conclude 
that it is too high to have resulted from chance errors and we accept it as 
evidence of genuine validity. 

The higher the validity coefficient, of course, the more valid the predictor. 
The practical implications of a validity coefficient may be expressed in several 
ways. For example, we may want to know what effect the use of a selection 
test will have on the percentage of employees who exceed a minimum standard 
of job success. Suppose that the percentage of successful employees prior to 
the use of the test was 6o, that the supply and demand is such that we must 
hire 3o per cent of all applicants for the job (selection ratio — 30), and that 
the validity coefficient of the test is .45. By reference to the appropriate table 
(see 2, pp. 162-164; 44), we find that under these conditions the percentage 
of successful employees following the use of the test should rise from 6o to 8o. 
Techniques are also available for estimating what effect the use of a predictor 
of given validity will have on the over-all output level of the selected workers 
(see 2, pp. 164-166; 10). It has been demonstrated that the expected increase 
in output is directly proportional to the validity of the test (7). Thus the 
improvement resulting from the use of a test whose validity coefficient is .45 
will be approximately 45 per cent as great as the improvement expected from 
a test of perfect validity. 

Cross Validation. When a set of predictors has been selected on the basis 
of their individual validities, the validity of the set as a whole should be 
checked in a new sample. Such an independent determination of validity in a 
second comparable sample is known as “cross validation” (37). This step is 
especially important when we start with a large number of predictors, such as 
individual items for an application form, and retain a small proportion for final 
use. Cross validation is also advisable in making up a battery of tests, espe- 
cially if a relatively large number of the original tests are discarded because 
of low validities. The validity of the composite—be it test battery, application 
form, or total test whose items were individually validated—should not be 
computed on the same sample used for choosing the individual predictors. If 
it were so computed, the validity coefficient would capitalize on chance errors 
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within the original sample and would consequently be spuriously high. In 
fact, a high validity coefficient will result under these circumstances, even 
when the set of predictors has no validity at all for the particular criterion. 

Let us suppose that, out of a sample of roo salesclerks, the 3o with the 
highest and the 3o with the lowest sales records are chosen to represent con- 
trasted criterion groups. If, now, these two groups are compared in a number 
of traits actually irrelevant to success in selling, certain chance differences will 
undoubtedly be found. Thus there might be an excess of urban-born and of 
red-haired salesclerks within the upper criterion group. If we were to assign 
each individual a "score" by crediting him with one point for urban residence 
and one point for red hair, the mean of such scores would undoubtedly be 
higher in the upper than in the lower criterion group. This is not evidence for 
the validity of the predictors, however, since such a validation process is based 
upon a circular argument. The two predictors were chosen in the first place 
on the basis of the chance variation that occurred in this particular sample. 
And the same chance differences operate to produce the mean differences in 
total score. When tested in another sample, the chance differences in fre- 
quency of urban residence and red hair are likely to disappear or be reversed. 
Consequently, the validity of the scores will collapse. 

A specific illustration of the need for cross validation is provided by an 
investigation of the Rorschach inkblot test (31). In an attempt to determine 
whether the Rorschach could be of any help in selecting sales managers for 
life insurance agencies, this test was administered to 8o managers, including 
an upper criterion group of 42 considered very satisfactory by their companies 
and a lower criterion group of 38 considered unsatisfactory. The 8o test 
records were studied by a Rorschach expert, who selected 32 signs, or response 
characteristics, occurring more frequently in one criterion group than in the 
other. A scoring key was developed in which signs occurring more often in the 
upper group were scored +1, those more frequent in the lower group — 1. 
When this scoring key was reapplied to the original 8o managers, 79 of the 
8o were correctly classified in the upper or lower group. The correlation be- 
tween test score and criterion would thus have been close to 1.00, However, 
when the test was cross-validated on a second comparable sample of 41 
managers, 21 in the upper and 20 in the lower group, the validity coefficient 
dropped to a negligible .o2. 

That such results can be obtained under pure-chance conditions was 
demonstrated by Cureton (14), who determined the "scores" of 29 students 
on each of 85 nonexistent test items by shaking 85 numbered tags and letting 
them fall, once for each student. If a tag fell numbered side up, the item was 
scored right; if blank side up, it was scored wrong. An item analysis conducted 
with these chance scores against the students' grade-point averages yielded a 
244tem “test.” Total scores on this test correlated .82 with the criterion of 
college grades in the original group of 29 students. This correlation, of course, 
does not demonstrate validity, but the operation of pure chance—or, to use 
Cureton's more vivid term, "baloney." 

Synthetic Validity. The same test may have high validity for office clerks 
or machinists in one company and low or negligible validity for jobs bearing 
the same title in another company. Surveys of validity coefficients against 
occupational criteria reveal wide variations. In one of the most comprehensive 
of these surveys, Ghiselli (22) reports, for example, that intelligence tests 
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correlated from —.40 to +.80 with proficiency criteria for general clerks; and 
spatial relations tests correlated from —.55 to +.65 with job performance of 
machine tenders. Of course, some of this variation is to be expected because of 
differences among tests designed to measure intelligence or spatial relations, 
as well as differences in the composition of the samples tested. But the range 
of validity coefficients found is far greater than could be explained in these 
terms. Differences in the criteria themselves are undoubtedly a major factor 
in the variation observed among validity coefficients. Thus the duties of office 
clerks or of mechanics may differ widely among companies or among depart- 
ments in the same company. 

"Test validities also vary when checked against different criteria of success 
in a single job. For example, the tests that predict performance in training 
may fail to predict subsequent job proficiency, or vice versa (22). Similarly, 
a test may fail to correlate significantly with supervisors' ratings and yet show 
appreciable validity in predicting who will resign and who will be promoted 
at a later date (1). When different types of criteria are compared for the 
same individuals, moreover, their intercorrelations are often quite low. For 
instance, accident records or absenteeism may show virtually no relation to 
productivity or error data (41). It is abundantly clear that: (1) job criteria 
are complex; (2) various types of criterion data, or subcriteria, for any one job 
may have little relation to each other; and (3) different jobs bearing the same 
name often represent a different combination of subcriteria. 

It is largely for these reasons that industrial psychologists find it necessary 
to develop custom-made tests or selection programs within each company. 
Although clearly desirable, however, this practice is often inapplicable in small 
companies or in jobs having few employees. The latter would be true, for 
instance, for certain executive and other high-level jobs. Even when adequate 
facilities and a sufficient number of validation cases are available, the valida- 
tion of each predictor against a composite criterion of job success may be of 
questionable value. If different subcriteria are telatively independent, then a 
more effective procedure would be to validate each predictor against that 
aspect of the criterion it was designed to measure. 

The concept of synthetic validity was introduced by Lawshe (32, 33) as 
one possible approach to these problems (see also 3). In finding synthetic 
validity, each predictor is validated, not against a composite criterion, but 
against job elements identified through job analyses. The validity of any 
predictor or battery of predictors is then computed synthetically for a given 
job on the basis of the job elements entering into that job and their relative 
weights. Thus, if a test has a high validity in predicting performance in 
delicate manipulative tasks, and if such tasks loom large in a particular job, 
then the test will have high synthetic validity for that job. A statistical 
technique known as the “J coefficient” (for “job coefficient”) has been de- 
veloped for estimating the synthetic validity of a test from a knowledge of 
the weights of job elements in the job and in the test (3, 40). This technique 
offers a promising tool for generalizing validity data from one job to another 
without actually conducting a separate validation study for each job. How 
effective the technique will eventually prove in practice can be determined 
only by further research. 

Subgroup Analysis and Moderator Variables. Validity may also vary 
among subgroups differing in personal characteristics. Thus a test may be a 
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better predictor of criterion performance for men than for women or for 
applicants from a lower than for applicants from a higher socioeconomic level. 
In such examples, sex and socioeconomic level are known as "moderator 
variables," since they modify the predictive validity of the test. Even when a 
test is equally valid for all subgroups, the same score may have a different 
predictive significance when obtained by members of different subgroups. For 
example, if a student who has had poor schooling obtains the same score on 
a college aptitude test as one who has had excellent schooling, the former is 
likely to earn better grades than the latter if they are both admitted to the 
same college. In such a situation, the same test score would lead to a different 
prediction for persons in different subgroups. 

Interests and motivation often function as moderator variables in individual 
cases. Thus, if an applicant has little interest in a job, he will probably do 
poorly regardless of his scores on relevant aptitude tests. Among such persons, 
the correlation between aptitude test scores and job performance would be 
low. For individuals who are interested and highly motivated, on the other 
hand, the correlation between aptitude test score and job success may be quite 
high. 

A moderator variable may itself be a test score, in terms of which individuals 
are sorted into subgroups. There have been some promising attempts to 
identify such moderator variables in test scores (4, 21, 23, 24). In a study of 
taxi drivers, for example, the correlation between an aptitude test and a 
criterion of job performance in a total applicant sample was only .220 (23). 
The group was then sorted into thirds, on the basis of scores on an occupa- 
tional interest test. When the validity of the aptitude test was recomputed 
within the third whose occupational interest level was most appropriate for 
the job, it rose to .664. Such findings suggest that one test might first be used 
to screen out individuals for whom the second test is likely to have low 
validity; then, from among the remaining cases, those scoring high on the 
second test are selected. In many ways, the analysis of validity data for sub- 
groups of the validation population may enhance the uscfulness of available 
predictors. 


FORMULATING A STRATEGY 
FOR PERSONNEL DECISIONS 


Having determined the validity of various predictors for a given job criterion 
and having on that basis chosen the predictors to be retained for operational 
purposes, the personnel psychologist must now formulate an appropriate 
strategy for using these predictors in making personnel decisions. In the ter- 
minology of decision theory, a strategy is a technique for utilizing information 
in order to reach a decision about the individual (13, ch. 3). The simplest 
kind of strategy involves the use of a single predictor. If circumstances of 
supply and demand determine the selection ratio, then all that is required is 
to start selecting individuals from the top of the predictor distribution and 
continue downward until the required number of persons has been hired. If 
only persons likely to fall above a minimum criterion level are acceptable, 
then a cutoff point in predictor scores can be established by examining the 
bivariate distribution of criterion and predictor scores. How high a test score 
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should applicants have, for example, to ensure that 9o per cent of those hired 
will reach or exceed a specified standard of job performance? 

Most selection programs, however, are not so simple as this. Since criteria 
are complex, several predictors are usually required for adequate criterion 
coverage. With multiple predictors, the question arises as to how they are to 
be combined in reaching a decision about the individual. Given five scores 
earned by an individual on tests or other selection devices, how are they to be 
used in deciding whether to accept or reject the applicant? 

Clinical Judgment. When tests or other predictors are employed in the 
intensive study of individual cases, as in selection of high-level personnel, 
executive evaluation, clinical diagnosis, or counseling, it is a common practice 
for scores to be interpreted by the examiner in arriving at a decision without 
further statistical manipulation (see, e.g, 48). To be sure, the individual's 
scores are evaluated with reference to any available general or local norms. But 
no statistical formula or other automatic procedure is applied in combining 
scores from different predictors or in interpreting the individual’s score pattern. 
Through a relatively subjective process, the examiner interprets the individual’s 
scores in terms of his own past experience with similar cases, his familiarity 
with particular job requirements, or his knowledge of psychological theory 
and relevant published research. The results are usually presented in the form 
of a detailed description of personality dynamics and aptitudes, supplemented 
by specific predictions such as “Mr. Greenwald will make a good executive 
vice-president for this company” or “Miss Adams will not respond well to 
psychotherapy.” 

Research has been under way to try to discover just what the clinician or 
interviewer does when evaluating and combining data to reach a decision (15, 
27, 28, 34, 35, 43, 48). Some investigators have also tried to compare the 
relative effectiveness of clinical and statistical methods of combining data in 
situations where both are applicable (see 35). But, in practice, both are 
rarely applicable to the same situation. We may lack either the extensive 
quantitative data needed to develop statistical prediction formulas or the time 
and personnel needed for clinical prediction.‘ 

Multiple Cutoff Scores. A simple statistical strategy for combining scores 
utilizes multiple cutoff points. This procedure requires the establishment of 
a minimum score on each predictor. Every applicant who falls below the 
minimum score on any one of the predictors is rejected; only those reaching or 
exceeding the cutoff scores on all predictors are accepted. This is the technique 
followed, for example, in the General Aptitude Test Battery (GATB) de: 
veloped by the United States Employment Service for use in the occupational 
counseling program of its State Employment Service offices (17, 25). This 
battery yields nine aptitude scores, only some of which are considered for any® 
one occupation. For example, accounting was found to require a minimum score 
of 105 in intelligence and 115 in numerical aptitude; plumbing called for a 
minimum of 85 in intelligence and 80 in numerical aptitude, in spatial apti- 
tude, and in manual dexterity. These are standard scores on a scale having a 
mean of 100 and an SD of 20. 

f The multiple cutoff strategy is readily applicable to predictors that are not 
linearly related to the criterion. Up to a certain point, for example, speed of 
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hand movement may be closely related to output in an assembly-line job. But, 
beyond that point, greater speed may be of no avail because of the mechanical 
limitations of the operation. The relevance of speed to performance in such a 
job would be reflected in the failure of persons below a certain speed. Yet the 
correlation between speed and performance level among employed workers may 
be low and negligible. The multiple cutoff technique is also especially suit- 
able for jobs in which superior ability along one line cannot compensate for 
a deficiency in an essential skill. Operators of sound-detection devices, for 
example, need good auditory discrimination. Anyone incapable of making the 
necessary discriminations will fail in such a job, regardless of how well quali- 
fied he may be in other respects. 

Multiple Regression Equation. The most precise strategy for combining 
predictor scores is that using the multiple regression equation. This equation 
yields a predicted criterion score for each individual. Specific techniques for 
computing regression equations can be found in many texts on psychological 
statistics Ce.g., 26). In such an equation, each predictor is weighted in direct 
proportion to its criterion correlation and in inverse proportion to its correla- 
tion with other predictors in the battery. Thus a test that overlaps other tests 
in the battery receives less weight than one that measures a unique aspect 
of the criterion, even if their validity coefficients are equally high. The weights 
employed are also such as to translate all predictor scores into the same units, 
so that they may be added. Finally, a constant is added to the total so as to 
express the predicted criterion score in terms of some convenient scale, usually 
the same scale employed in reporting the original criterion scores. 

In the process of computing a regression equation, it is also possible to 
select from the original set of predictors those that will yield the highest 
multiple correlation CR) with the criterion. The multiple correlation itself 
should be corrected for “shrinkage.” When the predictor battery is applied to 
a new sample, the multiple correlation will drop somewhat, because the 
weights derived in the first sample are optimum for that sample only. Because 
of chance errors in the original correlation coefficients, the weights will vary 
somewhat from sample to sample. The amount of shrinkage in R can be esti- 
mated by applying a shrinkage formula, or it can be determined empirically 
by cross-validating the battery. 

The use of a multiple regression equation in predicting an applicant’s job 
performance can be illustrated with the following equation: 


X', = .23X, + .32Xs + .18X; + .39X, + 46 


Let us suppose that an applicant, Henry Barclay, earns the following scores 
on four predictors, in the order listed in the above equation: 15, 8, 11, and 
13. His predicted criterion score (X’,) is then computed by substituting these 
values in the equation, as follows: 


X'e = (23) (15) + (32) (8) + (18) (1D) + (39) (13) + 46 = 59.06 


Rounding the total to two significant figures, we would report Mr. Barclay's 
estimated criterion score, or index of expected job performance, as 59. This 
score can then be evaluated either in reference to the scores of other appli- 
cants or in reference to an empirically established cutoff score. 
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It can be seen that, unlike the multiple cutoff strategy, the multiple regres- 
sion equation makes it possible for high scores in some predictors to com- 
pensate for low scores in others. For this reason, this technique should not 
be used when deficiencies in key skills ought to disqualify an applicant. With 
such jobs, a combination of multiple cutoff and regression equation may be 
employed, whereby applicants are first screened on key skills and then the 
regression equation is applied to the remaining cases. An advantage of the 
regression equation is that it provides an index of each individual's predicted 
job performance, permitting a comparison of the relative qualifications of 
applicants. With multiple cutoff procedures, individuals are simply classified 
as accepted or rejected. 

Classification Decisions. So far we have considered only selection deci- 
sions, in which each individual is either accepted or rejected. Deciding 
whether or not to hire a job applicant, admit a student to medical school, or 
accept a candidate for officer training are examples of selection. In classifica- 
tion decisions, on the other hand, every individual must be utilized. The 
question is where to assign each person so as to maximize the effectiveness 
of the total organization. Classification always involves more than one cri- 
terion—be it jobs, military specialties, courses of study, or clinical treatments. 

In the military services, classification is a major problem. After preliminary 
screening, everyone in the remaining manpower pool must be assigned to the 
military specialty where he can serve most effectively. An example of a classi- 
fication battery is that developed by the Air Force during World War II for 
assigning men to training programs for pilot, navigator, bombardier, or other 
aircrew specialties, Similar batteries are now used by all branches of the 
armed services. Another example of classification decisions is provided by the 
choice of a field of concentration by a college student. Vocational counseling, 
too, is based essentially on classification, since the client is told his chances 
of succeeding in different kinds of work. Clinical diagnosis is likewise a 
classification problem, the major purpose of each diagnosis being a decision 
regarding the appropriate type of therapy. 

In industry, classification decisions are required when new employees are 
assigned to different training programs or different jobs within the company. 
With rising employment and with a gradual shift from the limited goal of 
filling a particular job to the broader goal of utilizing every individual in the 
work he is best qualified to perform, classification assumes increasing im- 
portance. 

For classification purposes, predictors must have differential validity. Such 
validity is higher, the larger the difference between the correlations of the 
predictor with the separate criteria to be predicted. In a two-criterion classifi 
cation problem, for example, the ideal test would have a high correlation 
with one criterion and a zero correlation Cor preferably a negative correlation) 
with the other criterion. General intelligence tests are of little use for classi- 
fication decisions, since they predict success about equally well in most fields. 
Hence their correlations with the criteria to be differentiated would be too 
similar. An individual scoring high on such a test would be classified as suc- 
cessful for either assignment, and it would be impossible to predict in which 
he would do better. In a classification battery, we need some tests that are 
good predictors of criterion A and poor predictors of criterion B and other 
tests that are poor predictors of A and good predictors of B. Statistical pro- 
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cedures have been developed for selecting predictors so as to maximize the 
differential validity of a classification battery (8, 13, 29, 36, 46). When the 
number of criteria is greater than two, such procedures become quite com- 
plex. 


THE ILLUSION OF SUBJECTIVE VALIDATION 


A favorite sales appeal of charlatans and borderline "consultants" is to invite 
the business executive to undergo their particular brand of personality analysis 
so that he may "judge it for himself." The sort of personality description that 
they provide consists of vague generalities applicable to most people. When 
confronted with such a description of himself, the recipient is impressed with 
its apparent accuracy and insight and concludes that the method will work 
equally well with his employees, This illusory validation has been called the 
“Barnum effect” (see 5, p. 47; 16, p. 223), after Phineas T. Barnum, the 
famous showman who is credited with the remark that there's a fool born 
every minute. 

The readiness with which these general personality descriptions are ac- 
cepted as true was demonstrated at a conference of personnel managers (42). 
A standard personality inventory was administered to the group, and the 
papers were then collected for scoring. Before getting the actual scores, each 
participant received an identical copy of a fake "personality analysis." The 
analysis consisted of a list of descriptive statements, in which the same 13 
had been marked for each person. These 1 3 statements had been chosen 
because they were assertions applicable to the large majority of people; most 
were flattering, thus increasing their acceptability. Examples include: 


You have a great need for other people to like and admire you. 

You have a tendency to be critical of yourself. 

You have a great deal of unused capacity which you have not turned to your 
advantage. 

You pride yourself as an independent thinker and do not accept others' statements 
without satisfactory proof. 


The unmarked statements, interspersed among the others, were generally un- 
favorable comments about the personality being tested. 

After examining his own report, each man was asked to rate the accuracy 
of the description on the following five-point scale: amazingly accurate, rather 
good, about half and half, more wrong than right, almost entirely wrong. Of 
the 68 personnel managers in the group, 5o per cent marked the description 
"amazingly accurate,” 40 per cent "rather good," and 10 per cent “about half 
and half." When given the genuine test scores and told about the hoax, the 
personnel managers were quite vocal in their surprise. Apparently none had 
doubted the authenticity of the original reports. 

Similar results have been obtained with other groups, including college 
students and industrial managers (see 42, P. 350). Not only were the per- 
sonality descriptions as a whole accepted as accurate by the large majority 
of all these groups, but most of the individual statements marked for each 
person were also classified as predominantly correct. It will be recognized 
that this Barnum effect also accounts for much of the success of fortune 
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tellers, astrologers, and similar charlatans. All these systems provide only 
random assortments of universally applicable generalities and fail to indicate 
how one person actually differs from another. And it is this differentiation of 
individuals that is the primary function of personnel selection and evaluation. 


SUMMARY 


Developing a personnel selection program involves four major steps: (1) job 
analysis, (2) preparation and preliminary tryout of predictors, (3) validation 
of predictors, and (4) formulating a strategy for personnel decisions. Job 
analysis defines the problem by specifying worker qualifications for which 
predictors are needed. Job descriptions should be specific to the job and 
should concentrate on critical worker requirements. 

The predictors assembled or prepared on the basis of the job analysis should 
be checked for such characteristics as dispersion, difficulty level, reliability, 
and specificity before they are administered to the validation sample. Pre- 
dictive or concurrent validity is found by comparing predictor scores with 
criterion scores, Criterion contamination should be avoided. Different cri- 
terion indices may be combined in terms of weighted standard scores or by 
the dollar-criterion technique. Among the techniques for expressing the rela- 
tionship between predictor and criterion are significance of mean differences, 
expectancy tables, and correlation coefficients. Cross validation in a new sample 
is advisable, especially when only a small proportion of the original predictors 
survive initial validation. Synthetic validity provides for the validation of 
predictors against separate job elements, whence their validity for a total job 
can be estimated. 

When predictors are used in making personnel decisions, they may be 
combined through clinical judgment, multiple cutoff scores, or regression 
equations. For classification decisions, predictors need differential validity 
with regard to the separate criteria. An illusory validation based on subjective, 
anecdotal observations has been described as the Barnum effect. 
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Psychological Tests 


in Personnel Selection 


In developing a personnel selection program, the industrial psychologist gener- 
ally includes tests among the predictors. Over the past three decades, per- 
sonnel selection in general has come to rely more and more upon tests (76). 
As might be expected, however, tests are more likely to be employed by the 
larger companies. In a survey conducted in 1954, the use of employment 
tests was reported by 53 per cent of the companies with 5,000 or more em- 
ployees, in contrast to only 15 per cent of those with fewer than 250 em- 
ployees (67). It should be noted that there is hardly a job for which just 
one test is an adequate predictor. This is understandable in view of. the com- 
plexity of job criteria, discussed in the preceding chapter. The fact that validity 
coefficients of single tests are rarely larger than 4o and often fall below this 
value suggests that at best each test measures only some of the prerequisites 
for any particular job. For this reason, the common practice is to use a combi- 
nation of tests. Moreover, test scores should be supplemented with informa- 
tion derived from other sources, such as application forms, recommendations, 
and interviews, to be discussed in Chapter 4. 

In the present chapter, the major kinds of tests used in personnel selection 
will be examined and illustrated with a few examples. In accordance with 
common usage, these instruments have been grouped into five categories: 
intelligence tests, special aptitude tests, multiple aptitude batteries, achieve- 
ment tests, and personality tests. 

A more comprehensive coverage of available tests can be found in text- 
books on psychological testing, such as Anastasi C32 and Cronbach (21). 
Some books on industrial psychology devote several chapters to the use of 
tests in industry (see, e.g., 93, chs. 5-7). Of particular interest are surveys of 
tests validated against industrial criteria. A handbook by Dorcus and Jones 
(23), published in 1950, summarizes information about each test for which 
validation data against an industrial criterion are available. Ghiselli (35) pre- 
pared a comprehensive survey of validity coefficients of various types of tests 
against occupational criteria, although validities are not given separately for 
individual tests. Patterson (66) reports validities of specific tests in predicting 
performance in trade and vocational courses. 

Current information on tests can be obtained from several psychological 
journals, such as Psychological Abstracts, Journal of Applied Psychology, and 
Educational and Psychological Measurement; the last-named contains a special 
“Validity Studies Section.” Similarly, Personnel Psychology includes a section 
entitled “Validity Information Exchange” which summarizes new data on 
the validity of specific tests. A most important source of information about 
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tests is the series of Mental Measurements Yearbooks, edited by Buros (18). 
Published every few years, these yearbooks cover nearly all commercially 
available tests developed in English-speaking countries. Besides routine infor- 
mation about each test, they contain one or more critical reviews of most 
tests, prepared by test experts in the appropriate area. Direct information 
about specific tests can also be found in the catalogues of test publishers and 
in the manual that accompanies each test. The names and addresses of some 
of the larger American publishers and distributors of tests are listed in Ap- 
pendix E. 


INTELLIGENCE TESTS 


Nature of Intelligence Tests. Intelligence tests were first developed in 
the attempt to measure the individual's general intellectual level. It was hoped 
that the score on such a test would reflect a sort of over-all average of what 
the individual could do with different intellectual tasks. The tasks were 
chosen so as to sample a wide variety of abilities. In actual practice, however, 
intelligence tests do not sample all abilities equally. They are overweighted 
with some abilities and may omit or scarcely touch upon others. Most intelli- 
gence tests measure chiefly verbal comprehension, a fact that is illustrated by 
the very high correlations found between total scores on intelligence tests 
and scores on vocabulary tests. Arithmetic reasoning and numerical compu- 
tation are also frequently included. This combination of abilities will be 
recognized as being of prime importance in schoolwork at all levels. In fact, 
intelligence tests haye frequently been validated against academic achieve- 
ment as a criterion, Many intelligence tests correlate nearly as highly with 
tests of school achievement as they do with each other. For these reasons 
intelligence tests are often more accurately designated as tests of “scholastic 
aptitude.” 

It should also be borne in mind that different intelligence tests may covet 
a somewhat different combination of abilities. Thus performance and non- 
language tests often stress spatial and perceptual rather than verbal and nu- 
merical aptitudes. And as might be anticipated, their correlations with school 
achievement are somewhat lower. Partly for this reason and partly for other 
reasons pertaining to the construction of the tests, the same score may have 
quite different meanings when obtained on different intelligence tests. The 

IQ" is not a property of the organism, but a score on a particular test. It can- 


yas ie! interpreted without knowledge about the test from which it was 
erived. 


The IQ, it might be added, is not a very satisfactory way of expressing 
scores, especially for adults. Other types of scores, such as standard scores or 
percentiles, are technically preferable and more widely applicable. Standard 
scores, it will be recalled, express the individual's position as so many standard 
deviations above or below the mean of the normative group (see Chapter 2). 
'The so-called "deviation IQ," now used with several tests, is actually a stand- 
ard score with a mean of 100 and an SD equal to 15 or to some other speci 
fied value close to it. For instance, a person with a deviation IQ of 115 falls 
1 SD above the mean. Percentile scores report the percentage of persons in 
the normative group who scored below the individual. Thus, if an examinee's 
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score is at the 64th percentile, his performance excelled 64 per cent of the 
normative sample. 

In interpreting intelligence test scores, we should also bear in mind that 
such scores are influenced by the individual's past experience. Hence they 
should not be evaluated in the absence of background information about the 
individual. The same score obtained by persons varying widely in education, 
socioeconomic level, or cultural milieu may have different connotations in 
the prediction of subsequent performance in training, on a job, or in other 
criterion situations. 

Since intelligence tests are often used in industry for a rough, preliminary 
screening of applicants, several tests have been specially developed for this 
purpose. These tests are short, usually requiring between 10 and 30 minutes; 
they are easy to administer; and they can be objectively scored by a clerk or 
by machine. It should be recognized that, while such tests may have fair 
validity for some jobs, they may have little or no validity for others. The 
aptitudes sampled by intelligence tests are undoubtedly far more relevant 
to some types of jobs than to others. Jobs cannot simply be put into a hier- 
archy in terms of the amount of "intelligence" they require, because the type 
of "intelligence" needed for different jobs varies. For many occupations, espe- 
cially those requiring mechanical skills, tests of special aptitudes will serve as 
better predictors of achievement than will the so-called general intelligence 
tests. 

An early test that has been widely employed in personnel screening is the 
Otis Self-Administering Test of Mental Ability (62), which later served as 
the basis for developing the highest level of the Otis Quick-Scoring Mental 
Ability Tests (63). The earlier test has been used in screening applicants for 
such varied jobs as those of clerks, calculating-machine operators, assembly- 
line workers, and foremen and other supervisory personnel. Dorcus and Jones 
(23) cite 36 validation studies in which the Otis test was checked against an 
industrial criterion. Not all these studies yielded significant validity coeffi- 
cients, of course, but many of them did. In semiskilled jobs, for example, the 
Otis test correlated moderately well with success in learning the job and ease 
of initial adaptation, but not with subsequent job achievement (85). This 
would be expected for jobs that are largely routine, once they are learned. 
Among high-level professional personnel, who represent a select group in 
terms of academic achievement, correlations between Otis score and criteria 
of job success are usually negligible, since this test does not discriminate 
adequately at the upper levels. 

The Wonderlic Personnel Test (103) is an adaptation and abridgment of 
the Otis Self-Administering Test. Despite its time limit of only 12 minutes, 
it yields correlations of .81 to .87 with the original, longer Otis test. Correla- 
tions with industrial criteria vary widely with the nature of the job, the sam- 
ple tested, and the type of criterion measure employed (23). The test appears 
to have its highest validity in the selection of clerical workers. Another ex- 
ample is the Adaptability Test (92). In approximately 20 minutes, this test 
yields a single score based on a wide variety of verbal, numerical, and spatial 
items. It has been used in many kinds of industrial and office jobs, with fair 
evidence of validity. eds 

A more recently developed test is the Wesman Personnel Classification 
Test (98). Like most current intelligence tests, it yields separate Verbal and 
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Numerical scores, as well as a Total score. The Verbal score is derived from 
an 18-minute verbal analogies test, in which each item contains two blanks, 
as illustrated in Figure 4. The Numerical score is based on a 10-minute arith- 
metic computation test whose items were designed so as to put a premium on 
ingenuity and ability to perceive numerical relations. Correlations of Total 
scores with criteria of job success, usually consisting of supervisors’ ratings, 
range from .29 to .62. From the nature of the items, as well as from the dis- 
tribution of scores reported for various educational and occupational groups, 
this test appears to be better suited for higher-level than for lower-level per- 
sonnel. 

Another rapid screening test is the Oral Directions Test (55), a 15-minute 
test in which both directions and test items are presented on sound recordings. 
This procedure ensures more uniformity than would be possible through 
oral administration by different examiners. The test is rather heavily weighted 
with perceptual and spatial items and also depends to a considerable extent 
upon immediate memory for auditory instructions. Available data suggest that 
it discriminates somewhat better at the lower intellectual levels and may be 
particularly useful in screening applicants for such jobs as general laborer, 
maintenance and service worker, and messenger. The Oral Directions Test 
is part of a short battery, called Personnel Tests for Industry, which also in- 
cludes a 5-minute Verbal Test (99) and a 20-minute Numerical Test (22). 

Mention may also be made of nonlanguage tests, designed for use with 
illiterates, the foreign-speaking, or persons with widely different cultural back- 
grounds (see 3, ch. 10). Some of these tests can be administered without 
either spoken or written language, the instructions being conveyed by gesture, 
pantomime, and demonstration. Others require simple oral instructions that 
can be readily translated into different languages. Figure 5 shows two sample 
items from such a nonlanguage test, the Progressive Matrices (69). Each of 
the 6o items in this test consists of a matrix, or design, from which a piece 
has been removed. The subject must choose the correct insert from the given 
options. The earlier items in the series require primarily accuracy of discrimi 
nation, as in the first item in Figure 5. Later, more difficult items involve 
analogies, permutation and alternation of pattern, and other logical relations, 
as illustrated in the second item in Figure 5. 

Some Industrial Applications. Sir 


Since intelligence tests—especially those 


Fig. 4 Sample verbal item from Wesman Personnel Classification 7 
uit » t ^ 
CReproduced by permission. Copyright 1946, The Psychological e bn) 


Example 3. -++.. is to night as breakfast is to 


1. flow 2. gentle 


f 3. supper 
A. include B. morning y 


C. enjoy D. corner 


Supper is to night as breakfast is t 
have written 3B on the line at the right. LE tah elma 
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Fig. 5 Sample items from Progressive Matrices. (Reproduced by permission 
of J. C. Raven.) 


of the more common, verbal type—are largely measures of scholastic aptitudes, 
we should expect them also to predict performance in occupational training. 
Most investigations do in fact support this expectation, although the correla- 
tions with industrial training criteria are somewhat lower than those with 
academic criteria. In the extensive survey by Ghiselli (35, p. 134), the validity 
coefficients of intelligence tests with industrial training criteria averaged about 
-35, in contrast with the correlations of .50 or .60 commonly found with 
academic achievement. 

On the other hand, intelligence tests generally predict training performance 
somewhat better than they predict subsequent job performance. This un- 
doubtedly results in part from the greater similarity of training criteria to the 
educational criteria which intelligence tests were originally designed to pre- 
dict. Another reason is that samples of employed workers tend to be more 
homogeneous than training groups, because of the sequential dropping out of 
unsuccessful employees. Available measures of training criteria are also likely 
to be more reliable than measures of job criteria. Similarly, personality char- 
acteristics and fortuitous circumstances more often affect job performance 
than performance in training. All these factors would tend to make for higher 
correlations with training than with job criteria. 

When intelligence tests are correlated with criteria of job proficiency among 
employed personnel, the correlations vary widely with the nature of the job. 
In jobs requiring a high degree of the abstract verbal and numerical aptitudes 
covered by these tests, the correlation is likely to be high. In other jobs, how- 
ever, the correlation may be low or negligible, once the required minimum is 
exceeded. Thus in certain jobs the relation between job proficiency and in- 
telligence may be nonlinear. This might be the case, for instance, when job 
success depends chiefly upon traits not measured by intelligence tests, such as 
mechanical aptitude, artistic sensitivity, manual skills, or personality factors. 
Even in such jobs, of course, persons falling below a certain minimum in 
academic intelligence would fail. 

Another example of nonlinear relation is provided by routine and monoto- 
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nous jobs, for which individuals may have too much as well as too little in- 
telligence. Maximum proficiency in such jobs may be achieved by persons 
falling within an optimal range. This would also be true, of course, of any 
other relevant job skills, besides those measured by intelligence tests (14). 
The worker who is overqualified for such a routine job is likely to become 
bored, disinterested, inattentive, and dissatished—attitudes that may decrease 
job proficiency. Or he may become a chronic complainer and troublemaker, 
thus adversely affecting the morale of his co-workers. 

Intelligence tests may also serve to reduce job turnover. When employees 
are discharged because of inability to perform a job or when they leave 
voluntarily for a variety of reasons, the resulting disruption of operations and 
the hiring and training of replacements are often costly. Several investigations 
have shown a significant relation between intelligence test scores and job 
turnover. This relation is frequently nonlinear, indicating the greatest job 
stability for persons within an optimal intellectual range. In an early study 
on groups of cashiers and inspector-wrappers, average length of service in 
days was at its peak (142 days) for persons scoring between 4o and 49 points 
on an intelligence test (96). As scores rose, length of service declined steadily 
to a low of 35 days for those scoring 9o or above. Length of service likewise 
fell off as intelligence test scores decreased, reaching a mean of only 3 days 
for those scoring between 1o and 19. i 

In another study (12), clerical jobs were classified into levels, from the 
easiest CA) to the hardest (E). As can be seen in Table 2, among employees 
scoring above 110 on an intelligence test, turnover rate was greater the easier 
the job, this rate reaching roo per cent for the two lowest levels, Those scor- 
ing below 8o, on the other hand, showed high turnover when put in the 
harder jobs and minimal turnover at the easiest job level. Obviously proper 


TABLE 2 Relationship of Labor Turnover 
to Intelligence Test Score and Job Level 
(Data from Bills, 12, p. 155) 


—Ó——— M r—— MM GÀ em 


Work Above 110 Below 80 
A (easiest) 100 37 
B 100 62 
C 72 50 
- 53 58 
E Chardest) 4] 66 


job placement on the basis of intelligence test scores would have markedly 
reduced turnover in this company. 


When promotional opportunities are available within a company, intelli- 


gence tests are often used to help determine an individual's promotion poten- 
tial. This is especially important when employees are hired at one job level 
with the definite expectation that they will be upgraded to a higher level 
within a specified interval. Some data relevant to this application of intelli- 
gence tests are provided by a follow-up study of 120 clerical workers (68). 
Table 3 shows the relation between the intelligence test scores of these em 
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ployees at the time they were hired and the job level they subsequently at- 
tained within a seven-year period. It is evident that persons with high ini- 
tial scores tended to be promoted to the higher-level jobs, while those with 
lower scores tended to remain in lower-level jobs. For example, of those scor- 
ing above 100, only 15 per cent remained in a low-level job, 3o per cent were 
in middle-level jobs, and 55 per cent in high-level jobs. In contrast, of those 
scoring below 81, 64 per cent remained in low-level jobs, 24 per cent were 
in middle-level jobs, and 12 per cent in high-level jobs. 


TABLE 3 Relationship between Initial Intelligence Test Score 


of Clerical Workers and Job Level Subsequently Attained 
(Data from Pond and Bills, 68, p. 46) 


a  — eee 


Percentage of Employees in 


Intelligence Low Middle High 
Test Score Job Job Job 
Above 100 15 30 55 
81-100 40 48 12 
Below 81 64 24 12 


ee ee RE. 

Another example is provided by an investigation of supervisors (93, p. 
118). Within one company, 7o operators were promoted to supervisory jobs. 
On the first day of their supervisory training classes, they were given an in- 
telligence test, and the scores were filed for future research use. Six months 
later, about one-fourth of the 7o supervisors were no longer on their jobs, 
having left or been demoted, dismissed, or transferred. Figure 6 shows a rather 
striking relation between intelligence test score and the proportion remaining 
in their supervisory jobs. F 

Job Placement of Mental Defectives. It has long been recognized that 
many industrial jobs can be filled by persons of subnormal intelligence. In 
several investigations, individuals of known mental age! have been placed on 
Jobs on a trial basis and then followed up long enough to ascertain whether 
or not they could carry out their duties satisfactorily (7, 8, 19). Through such 
empirical procedures, lists of jobs have been drawn up for which persons of 
each mental age level from 5 to 14 could meet required performance stand- 
ards. A mental age of 5, for instance, proved adequate for such jobs as sand- 
papering furniture, washing dishes, or making nets; a mental age of 6, for 
mowing lawns, handling freight, or mixing cement; and a mental age of 7 
for rough painting, simple shoe repairing, or simple carpentry. More highly 
skilled crafts, as well as the operation of machinery, required somewhat higher 
mental ages. 

It is a well-established practice, of course, for institutionalized mental de- 
fectives to perform jobs such as these under supervision in the institution it- 
self. Since World War II, however, rehabilitation programs have been 
actively under way to enable high-grade mental defectives to return to the 
community. Several factors have contributed to this development. First, labor 


ìi Such mental ages are generally determined through the administration of individual 
intelligence tests, such as the Stanford-Binet or Wechsler scales (see 3, chs. 8 and 12). 
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Fig. 6 Relationship between intelligence test score and supervisory job 
tenure. (From Tiffin and McCormick, 93, p. 119.) 


shortages make it desirable to tap one more potential source of manpower 
in the trainable mental defective. Second, rehabilitation and discharge pro- 
vide one way to help “depopulate” overcrowded institutions for mental defec- 
tives. The greatest impetus to such rehabilitation programs undoubtedly came 
from legislation enacted in 1943 that made it possible to include mental de- 
fectives in vocational rehabilitation programs for the handicapped and made 
Federal funds available to the states for this purpose. 

Since that time, several states have initiated vocational rehabilitation pro- 
grams for institutionalized mental defectives. The success of such programs 
depends upon several conditions, including selection of promising cases, in- 
stitutional training in self-care and in vocational skills, proper job placement, 
and availability of counseling facilities during the adjustment period as well 
as after discharge when needed (28). Follow-up investigations covering inter- 


vals of several years have yielded consistently encouraging results (20, 45, 522- 


Although some mental defectives must be returned to institutions and a few 
get into difficulties, the m 


£ ajority make satisfactory social and vocational ad- 
justments. 


At the opposite end of the intelligence 
and engineers have stimulated intensive 
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tary services, or the contributions of individual eminent scientists (86-88). 
Some attention, however, has also been given to creative productivity in the 
irts Ce.g., 24, 39). Research has focused on three major aspects of the prob- 
lem: C1) formulation of suitable criterion measures for creative behavior ( $ 
17, 87); (2) investigation of conditions conducive to the development and 
manifestation of creative talent (70-72, 86-88); and (3) the development 
of predictors of creative achievement (4o, 86-88). 

Studies of the characteristics of gifted persons, with which predictor de- 
vclopment logically begins, have stressed the importance of personality as well 
is intellectual factors. Examples of the former include: liking for manipula- 
tion of ideas or "toying with ideas" for its own sake, need for independence 
and autonomy, intellectual persistence, and a strong drive for the activity of 
intellectual ordering (86). With regard to intellectual qualities, it is generally 
recognized that creative talent is not synonymous with academic intelligence 
as measured by traditional intelligence tests. Several investigations have been 
concerned with identifying the abilities that constitute creative talent, as well 
as developing new kinds of tests for measuring these abilities. The most com- 
prehensive of these studies is that of Guilford and his associates, who set out 
to explore four areas of thinking, namely, reasoning, creativity, planning, and 
evaluation. For the present purpose, interest centers on the various fluency, 
flexibility, and originality factors found to be most closely associated with 
creativity (101). These factors come under the heading of "divergent think- 
ing,” which Guilford describes as "the kind that goes off in different direc- 
tions" (4o, p. 381). Such thinking permits changes of approach in problem 
solving and leads to a diversity of answers. In contrast, "convergent thinking 
leads to a single right answer determined by the given facts. 

The divergent-thinking factors identified in Guilford's studies can be illus- 
trated by examining some of the tests found to be heavily loaded with each 
factor (4o, pp. 381-390). Some of these tests have been published; others will 
become available in the near future, Until more data are gathered with these 
tests, however, all should be regarded as research instruments only. A test of 
word fluency requires the subject to write words containing a given letter. In 
this, as in all fluency tests, the score is simply the total number of acceptable 
responses written in the time allowed. In one ideational fluency test, the task 
is to name things that belong in a certain class, such as fluids that will burn. 
In another, the subject lists different uses for a common object, such as a 
brick or pencil. Associational fluency can be illustrated by a test calling for 
all words similar in meaning to a given word, such as “hard.” Words for this 
fest were chosen because each has a variety of meanings. Another test re- 
quires the insertion of an adjective to complete each simile (eg, "As 
as a fish”), Expressional fluency can be measured by a test of four-word 
combinations, in which the subject writes four connected words, the first 
letters of which are given. For example, if given “Y____ ug ee 
t— — d. ,"the subject could write “You can throw dice.” The sub- 
ject continues to write different sentences for each set until time is called. 

Among the tests having high loadings in flexibility factors may be men- 
tioned Hidden Figures and Match Problems. In the first of these tests, illus- 
trated in Figure 7, the subject must identify a simple geometric figure em- 
bedded in a more complex figure. The Match Problems test requires the re- 
moval of a specified number of matchsticks to leave a given number ot squares 
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Fig. 7 Sample items from the Hidden Figures test. Which of the five figures 
at the top is concealed in each of the item figures? Answers: 1, A; 2, D. (From 
Guilford, 40, p. 386.) 


or triangles. An example from this test is reproduced in Figure 8. In all these 
tests, a good performance requires freedom from persistence of approaches, 
permitting a restructuring of the given stimuli. 4 
Originality can be measured by an adaptation of the familiar free associa- 
tion test, in which the subject must respond rapidly to each stimulus word 
by giving the first word that occurs to him. In scoring this test, each response 
is weighted in inverse proportion to its commonness in the general popula- 
tion. There is some evidence that such groups as scientists, engineers, artists, 
musicians, and writers tend to give less common associations than executives, 
salesmen, teachers, and politicians. Another example is the Consequences 
test, which provides separate scores in ideational fluency and originality. In 
this test, the subject is told to list as many different consequences of some 
hypothetical event as he can. For example, “What would be the results if 


Fig. 8 Sample item from the Match Problems test. If each line is a match, 
can you take away four matches from A, leaving three squares and nothing 
more? If the subject works under the assumption that all squares must be of 


the same size, he will be unable to reach the correct solution, shown in B. 
(From Guilford, 40, p. 387.) 


— — 
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people no longer needed or wanted sleep?" Responses are classified as obvi- 
ous or remote according to rules given in the manual. The number of obvious 
responses provides the ideational fluency score; the number of remote re- 
sponses, the originality score, 


and expanded definition of intelligence. In contrast to earlier definitions, this 
formulation gives more weight to nonverbal and to creative processes. In the 
light of such developments, intelligence tests of the future will probably in- 
clude more spatial and other nonverbal content and will tap more reasoning 
and creative functions, in addition to the traditional functions of comprehension 


SPECIAL APTITUDE TESTS 


In the selection of personnel for most jobs, intelligence tests must be 
supplemented with tests of special aptitudes. Frequently, such aptitude tests 
are custom-made to fit the demands of a particular job. For certain aptitudes 
common to many jobs, however, published tests are available. Most of these 
standardized tests for general use are in the areas of vision, hearing, simple 
motor skills, mechanical aptitude, and clerical aptitude. 

Many industrial jobs call for minimum standards in vision or hearing. Con- 
siderable research is available showing the effects of sensory capacities upon 
quantity and quality of output, spoilage and waste of materials, job turn- 
over, and accidents (93, chs. 5 and 14). Many types of military specialties 
likewise make heavy demands upon visual or auditory capacities, 

Both visual and auditory sensitivity include not one but many functions, 
Among the visual characteristics found to be of greatest practical importance 
may be mentioned: near acuity at “reading distance” (13 to 16 inches), far 
acuity Cusually measured at 20 feet), perception of distance or depth, muscu- 
lar balance of the eyes Cphoria), and color discrimination. Composite in- 
struments have been developed for measuring all these aspects of vision. An 
example is the Ortho-Rater, illustrated in Figure 9. Providing measures of 
each of the above characteristics, the Ortho-Rater is widely used in industry. 

In hearing, the aspect of most general concern is auditory acuity, measured 
by finding the faintest sound that the individual can just barely hear. Both 
individual and group audiometers are available for measuring such acuity 
(47). Other hearing characteristics important for certain industrial and mili- 
tary functions include discrimination of pitch and of loudness of different 
sounds, 

Tests of motor skills are used widely in the selection of industrial and mili- 
tary personnel. Usually, however, such tests are tailored to meet specific job 
requirements. The United States Air Force, for instance, developed many 
motor coordination tests for its classification program (31, 57). An example 
is the Complex Coordination Test, illustrated in Figure ro. In this test, the 
examinee operates three controls similar to the rudder and stick controls used 
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Fig. 9 Ortho-Rater for testing visual functions. (Courtesy of Bausch & 
Lomb Optical Company.) 


in an airplane. The stimuli consist of different patterns of signal lights on the 
vertical panel. 

Most commercially available tests of motor skills call for very simple types 
of movement. Often they require the manipulation of small objects, as in the 
Purdue Pegboard (90), illustrated in Figure 11. In one part of this test, the 
subject inserts a pin in each hole, using the right hand, the left hand, and 
both hands simultaneously in successive trials. In another part of the test, 
pins, collars, and washers are assembled and fitted into each hole, the sub- 
ject employing both hands in a prescribed procedure, Tests such as these are 
utilized in selecting workers for assembling, packing, routine machine opera- 
tions, and similar jobs. Their validity should be checked for each job, how- 
ever, since motor skills have proved to be highly specific (see 3, pp. 383-386). 
Even for the same Job, moreover, different. tests may be required to predict 
quantity and quality of output, as illustrated in a study of power-sewing- 
machine operators (64). ; 

Tests commonly classified under the heading of mechanical aptitude cover 
à variety of functions. Motor skills enter into some of these tests, either be- 
cause the test may call for rapid manipulation of materials or because special 
subtests designed to measure motor dexterity are included. Perceptual speed 
and accuracy also play a part in certain tests. The major factors measured by 
mechanical aptitude tests, however, are spatial visualization and mechanical 
comprehension. 
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Fig. 10 Complex Coordination Test used in Air Force classification 
Program during World War II. (Courtesy of U.S. Air Force; for 
description of test, see Melton, 57.) 


A test designed to measure the first of these abilities is the Minnesota 
Paper Form Board. This is one of a series of tests originally developed in an 
extensive investigation of mechanical aptitude conducted at the University 
of Minnesota (65). Its current revision (56) is illustrated in Figure 12. Each 
item consists of a geometric figure cut into two or more parts. The subject 
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Fig. 11 Purdue Pegboard. (Courtesy of Science Research Associates.) 


must visualize how the pieces fit together and choose the drawing that shows 
the correct arrangement. A large number of studies conducted with this test 
indicate that it is one of the most valid available instruments for measuring 
the ability to visualize and manipulate objects in space (56). Among the 
criteria employed in this research were performance in shop courses, grades 
in engineering and in other technical and mechanical courses, supervisors’ 


Fig. 12 Sample items from the Revised Minnesota Paper Form Board. 
(Reproduced by permission of The Psychological Corporation. Copyright 
1941, Rensis Likert and W. H. Quasha.) 
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For each item, the subject must choose the figure which would 
result if the pieces in the first section were assembled. 
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x 


Which room has more of an echo? 


inf 


Which would be the better shears 
for cutting metal? 


F ig. 13 Sample items from the Test of Mechanical Comprehension, 
Form AA. CReproduced by permission. Copyright 1940, The 
Psychological Corporation.) 


ratings, and objective production records. The test has also shown some 
validity in predicting the achievement of dentistry and art students. 

Another test that has been. widely employed for both civilian and military 
purposes is the Bennett Test of. Mechanical Comprehension (9), illustrated in 
Figure 13. Several published studies of this test provide evidence of both 
concurrent and predictive validity for mechanical trades and engineering. 
Correlations between .30 and .6o have been found with either training 
or job-proficiency criteria in many kinds of mechanical jobs (35, 66). During 
World War II, this test proved to be one of the best predictors of pilot suc- 
cess (42). Its validity for this criterion seems to have resulted chiefly from 
the contribution of a mechanical information factor and a spatial visualization 
factor, which together accounted for about 6o per cent of the variance of its 
scores. 

Tests designed to measure clerical aptitude are primarily concerned with 
perceptual speed and accuracy. A well-known example is the Minnesota 
Clerical Test (4), illustrated in Figure 14. In the two parts of this test, the 
subject compares pairs of numbers and names, respectively, to determine 
whether they are the same or different. Although a deduction is made for 
errors, the scores depend predominantly upon speed. Moderately high correla- 
tions have been reported between scores on this test and ratings by office 
supervisors or by commercial teachers, as well as performance records in 
courses and in various kinds of clerical jobs (4, 23). Several studies compared 


58 


PERSONNEL PSYCHOLOGY 


When the two numbers or names in a pair are exactly the same, 
make a check mark on the line between them. 


66273894 66273984 


527384578 527384578 


New York World New York World 


Cargill Grain Co. Cargil Grain Co. 


Fig. 14 Sample items from the Minnesota Clerical Test. (Reproduced 
by permission. Copyright 1933, The Psychological Corporation.) 


the scores obtained by different levels of clerks, by clerks and by persens in 
other occupations, and by employed and unemployed clerks. All these com- 
parisons yielded significant mean differences in the expected directions. 

It is apparent that such a relatively homogeneous instrument as the 
Minnesota Clerical Test measures only one aspect of clerical work. Clerical 
jobs cover a multiplicity of functions. Moreover, the number and particular 
combination of duties vary tremendously with the type and level of job. Even 
specific jobs designated by the same name, such as typist, filing clerk, or ship- 
ping clerk, may differ considerably from one company to another. Despite this 
diversity of activities, however, job analyses of general clerical work indicate 
that a relatively large proportion of time is spent in such tasks as classifying, 
sorting, checking, collating and stapling, stuffing and sealing envelopes, and 
the like (10). Speed and accuracy in perceiving details are thus of primary 
importance for the clerical worker, especially in the lower-level, routine clerical 
jobs (44). For higher-level clerical jobs, the verbal and numerical abilities 
measured by intelligence tests are undoubtedly more relevant. Several clerical 
aptitude tests combine verbal and numerical tests with tests of perceptual 
speed and accuracy in a single battery (see 3, pp. 395-396). Such composite 
tests are useful provided they yield separate scores for each part, since the 


telative weights of these abilities may vary widely from one clerical job to 
another (see, e.g., 77). 


MULTIPLE APTITUDE BATTERIES 


Special aptitude tests in such fields as the mechanical and the clerical were 
originally developed to supplement general intelligence tests. Particularly in 
vocational testing, it soon became evident that intelligence tests did not cover 
all requisite intellectual functions. Concurrently, analyses of intelligence test 
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performance revealed conspicuous intraindividual differences from one intelli- 
gence test to another or among different parts of the same test. A person might 
do well on a written test but poorly on a performance test requiring spatial 
orientation; or, in the traditional written test, he might be consistently better 
on the numerical than on the verbal items. It thus appeared that the intelli- 
gence tests themselves might be measuring a small number of relatively inde- 
pendent aptitudes in which the same individual could vary appreciably. 

A parallel development in basic research was the study of the nature and 
organization of intelligence by the techniques of factor analysis (see 3, pp. 
339-348). Essentially such techniques involve the administration of a large 
number of tests to the same persons. The scores on all the tests are then 
intercorrelated and the correlations subjected to further statistical analyses in 
order to determine which tests tend to cluster together and which are relatively 
independent. Examples of the separate abilities, or "factors," thus identified 
include verbal comprehension, numerical computation, quantitative reasoning, 
perceptual speed, spatial visualization, and mechanical comprehension. 
Through factor analysis, what had formerly been called intelligence could now 
be subdivided into relatively independent abilities, and these abilities could 
be combined with those underlying the special aptitude tests to provide a more 
comprehensive picture of intelligence. 

The construction of multiple aptitude batteries was the direct outcome of 
factorial research. Rather than yielding a single global score, such as an IQ, 
these batteries provide a profile of scores on separate tests, most of which 
correspond more or less closely to traits identified through factor analysis. 
Several batteries of this sort are now commercially available (see 3, ch. 13; 84). 
An example is the Differential Aptitude Tests (DAT), for which the largest 
amount of validity information is available (11). The DAT yields scores in 
eight abilities: Verbal Reasoning, Numerical Ability, Abstract Reasoning, 
Clerical Speed and Accuracy, Mechanical Reasoning, Space Relations, Spelling, 
and Grammar. Sample items from each of these tests can be found in 
Chapter 16 (Fig. 107, pp- 436-437). Figure 15 is a profile chart showing the 
DAT scores obtained by a young man whose greatest strengths are in Verbal 
Reasoning and Numerical Ability. The score in the third column, based on 
the sum of these two scores, provides a single index of scholastic aptitude 
similar to that obtained from most intelligence tests. 

Another multiple aptitude battery is the more recently published Flanagan 
Aptitude Classification Tests, or FACT (30). Oriented principally toward 
vocational counseling and employee selection, this battery comprises 21 tests, 
each designed to measure a "critical job element" identified through job 
analyses. Examples of these tests include: Assembly (ability to visualize the 
appearance of an object assembled from a number of separate parts); Plan- 
ning Cability to plan, organize, and schedule; ability to foresee problems that 
may arise and to anticipate the best order for carrying out the various steps); 
Ingenuity (creative or inventive skill; ability to devise ingenious procedures, 
equipment, or presentations). Sample items from the Assembly and Ingenuity 
tests are illustrated in Figure 16. 

On the basis of the initial job analyses, scores from the FACT battery were 
combined into "occupational aptitude scores" for over three dozen jobs ranging 
from humanities teacher to plumber. The predictive validity of these oc- 
cupational scores is being investigated by following up the educational and 
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Fig. 15 Profile of scores on the Differential Aptitude Tests. Vertical 

bars show distance above or below norm on each test. Percentiles are 

spaced so as to correspond to equal distances in a normal distribution. 
CReproduced by permission. Copyright 1963, The Psychological Corporation.) 


vocational careers of high school students tested in the standardization sample. 
Preliminary findings suggest moderately promising validity, but a definitive 
evaluation of this battery must await the accumulation of many more rescarch 
data. 

Mention should also be made of the classification batteries developed for 
use in the military services and of the General Aptitude Test Battery (GAT B 
prepared by the United States Employment Service (26). The latter consists 
of 12 tests yielding the following nine scores Intelligence, Verbal Aptitude, 
Numerical Aptitude, Spatial Aptitude, Form Perception, Clerical Perception, 
Motor Coordination, Finger Dexteritv. and Manual Dexteritv Fhe Intelligence 
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Assembly:In each figure, parts are to be assembled so that places having the same letter are 
put together. Which of the five assemblies below shows the parts put together correctly? 


Ingenuity: Think of a word or words that will complete a clever and ingenious solution to 
the slated problem. Then choose the alternative having the same first and last letters and 
the same number of letter spaces as your answer. 


51. A hostess for a children's party wanted to serve 
ice cream in an interesting manner, and she decided 
to make a clown for each child. She placed a ball of 
ice cream to represent the clown's head on a round 
cookie which served for a collar, and on top of this 
she inverted a 


an 3 


C 


t 


6 os. 


"moos» 


LESEN 


Answers: — Assembly, C Ingenuity, D (cone) 


Fig. 16 Sample items from Flanagan Aptitude Classification Tests. 
(Reproduced by permission of John C. Flanagan.) 


Score is a factorially complex but practically useful measure based on per- 
formance in vocabulary, arithmetic reasoning, and spatial visualization tests. 
The other scores correspond more nearly to independent abilities identified 
through factor analysis. i 

So far, multiple aptitude batteries have been used more extensively for 
vocational counseling of individuals than for personnel selection or placement. 
With the growing emphasis on classification programs, however, such batteries 
may come to play a more prominent part in industrial personnel programs. 
When selecting applicants for a single job, most personnel psychologists rely 
9n à general intelligence test plus very specific aptitude tests chosen to fit the 
particular job. But when the problem is that of maximum utilization of an 
available manpower pool, a multiple aptitude battery measuring a few broad 
abilities that cut across many different jobs is more effective. A major weakness 
of current multiple aptitude batteries, however, is their disappointing per- 
formance with regard to differential validity (see Chapter 2). It is possible 
that with an increasing utilization of vocational proficiency criteria—in con- 
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trast to academic or vocational training criteria—more differentiating batteries 
can be developed. Further research is undoubtedly needed to explore the 
potential contribution of this type of testing instrument. 


ACHIEVEMENT TESTS 


Achievement tests are designed to assess the proficiency level currently attained 
by an individual. An end-of-term school examination is an achievement test, 
as are the many standardized tests employed in educational settings, from the 
elementary grades to graduate and professional schools. When used in in- 
dustrial situations, achievement tests are often called trade tests. It is customary 
to contrast achievement tests with aptitude tests, the latter including i»telli- 
gence tests, special aptitude tests, and multiple aptitude batteries. From one 
point of view, the distinction is one of degree of uniformity of assumed anieced- 
ent experience. Achievement tests measure the effects of relatively uniform sets 
of experiences, such as a course in Spanish, in speed writing, or in TV repair- 
ing. Aptitude test performance, on the other hand, reflects the cumulative 
influence of a multiplicity of experiences in daily living, including nature and 
extent of general education, home environment, and a myriad other influences. 
We might say that aptitude tests measure the effects of learning under relatively 
uncontrolled and unknown conditions, while achievement tests measure the 
effects of learning under partially known and controlled conditions. 

A second distinction between aptitude and achievement tests pertains to 
their uses. Aptitude tests serve to predict subsequent performance. They are 
employed to estimate the extent to which the individual will profit from 
training or to forecast his performance in a new situation. Achievement tests, 
on the other hand, generally provide a terminal evaluation of the individual’s 
status upon the completion of training, The emphasis in such tests is upon 
what the individual can do at the time. Neither distinction, however, can be 
rigidly applied. Some aptitude tests may presuppose fairly specific and uniform 
prior training, while some achievement tests cover relatively broad and un- 
standardized educational experiences. Similarly, any achievement test may be 
used as a predictor of future learning. As such it serves the same purpose as 
an aptitude test. 

When used in personnel selection, achievement tests play the same role as 
aptitude tests. In such situations, they should be validated against criteria of 
subsequent job proficiency, as is done for any other predictor. Examples of this 
use of achievement tests can be found in the hiring of typists, engineers, ap- 
prentices in skilled trades, or any other job for which specialized vocational 
training is a prerequisite. Achievement tests may also be used to assess the 
effectiveness of a training program, to determine when individuals have attained 
the required proficiency level in a training course, to identify present employees 
in need of special training, and to help in selecting employees for transfers or 
promotions. 

Some achievement tests are concerned with technical information, other> 
with job skills. Information may be tested by a set of standardized oral ques 
tions or by written tests, the latter permitting either group or individual 
administration, Examples of commercially available written tests developed for 
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general use are provided by the Purdue Vocational Tests? which cover such 
jobs as those of machinist, engine-lathe operator, electrician, or carpenter. 

\nother technique for appraising vocational training and experience in- 
volves the use of oral trade tests, or short series of questions about specialized 
trade knowledge. The items of information are chosen so as to be fairly easy 
for anyone who has actually worked in a particular trade but rarely familiar to 
others. For this reason, oral trade questions are often used as a check on the 
reported job experience of an applicant. The United States Employment 
Service has developed standardized sets of oral trade questions for well over 
a hundred jobs (79, ch. 3 and pp. 156-162). Each set consists of 15 questions, 
such as the following taken from the bricklayer's test: 


Question: What do you mean by building up a lead? 
Answer: Building up a section (corner) of wall. 


For each occupation, the questions that had been formulated on the basis of 
extensive job analyses and preliminary tryouts were administered to three 
groups of workers: experts, beginners, and workers in related occupations. 
Items were chosen for the final sets in terms of the significance and magnitude 
of the differences between the percentage of workers in each of these groups 
who answered the question correctly. The differences were so large that little 
overlapping in total scores remained when the three groups were compared. 

In the large majority of jobs, information is a necessary but not a sufficient 
condition for satisfactory performance. Information tests thus need to be supple- 
mented by tests of job skills. These skills are usually measured through 
standardized work samples which closely reproduce actual job processes. Tests 
administered before the granting of a driver's license or a pilot’s license 
illustrate the work sample technique. Another example is the Miniature Punch 
Press Test, reproduced in Figure 17. More complex work-sample tests have 
been developed for the evaluation of technical military personnel (see, e.g., 
13). Among the best-known work-sample tests available for general use are 
those designed for clerical jobs, especially typewriting, stenography, and book- 
keeping (e.g, 74, 95). Stenographic tests usually employ sound recordings 
to ensure uniformity in speed and clarity of dictation. 

OF particular interest is a recent attempt to devise a work-sample test for 
certain aspects of executive work. Known as the In-Basket Test, this technique 
has been adapted for testing Air Force officers in administrative positions (34), 
business executives (49), and school principals (33, 46). Simulating the 
familiar "in-basket" found on the administrator's desk, this test provides à care- 
fully prepared set of incoming letters, memoranda, reports, papers to be signed, 
and similar items. Before taking the test, the subject has an opportunity to 
study background materials for orientation and information regarding the 
hypothetical job. During the test proper, his task is to handle all the matters 
in his in-basket as he would on the job. All actions must be recorded in 
Writing but may include letters, memos, decisions, plans, directives, informa- 
tion to be obtained or transmitted by telephone, agenda for meetings, or any 
other notes. 

o Distributed partly by Science Research Associates and partly by University Book 
Store, West Lafayette, Ind. 
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Fig. 17 Miniature Punch Press Test. (From Tiffin and Greenly, 
91, p. 451.) 


PERSONALITY TESTS 


It is now widely recognized that many job failures result from person: 
deficiencies. Having the required intellectual level, knowledge, and skills does 
not ensure that the individual will be an effective producer, a satishec 
employee, or a well-liked co-worker. Personality characteristics may determine 


success or failure on any kind of job. Nevertheless, it is obvious that they 
persona 


ality 


play a particularly important part in jobs requiring extensive inter 
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contacts, such as selling and supervisory work. Understandably enough, the 
possible contributions of available personality tests to personnel selection have 
been somewhat more fully explored in these fields than elsewhere. 

Several attempts have also been made to develop special tests for salesmen 
C15, 16, 53) and for managerial personnel (29, 32, 59, 73). The construction 
of objective testing instruments for such traits as empathy, social sensitivity, 
or supervisory attitudes presents many difficulties. Frequently the resulting 
tests measure mostly intelligence or familiarity with acceptable human- 
relations practices and may show little relation to criteria of actual interpersonal 
behavior. The earlier instruments have proved disappointing; the later ones, 
although more promising, are still largely untried. But the continuing research 
program in this area, coupled with improved techniques of test construction, 
may yield fruitful results before too long. 

Of considerable relevance, too, is the extensive research concerned with 
executive job analyses and criteria of executive success (see 36, 75, 78, 89). 
Much has been learned about the relative amount of time executives spend 
in different duties, the traits considered important in an executive, causes of 
executive failure and success, differences between executives at various levels 
or between industrial and military executives, and similar questions. It has been 
found, for instance, that all executives spend a large proportion of their time 
in interpersonal contacts and that public relations, or “representative,” functions 
assume increasing importance at the higher executive levels. Similarly, inter- 
personal traits are frequently cited to account for the success or failure of 
individual executives. 

The increasing use of personality tests in executive selection has certainly 
not been without its critics, as exemplified by the vituperative treatment given 
this topic by Whyte in The Organization Man (100). A thorough and lucid 
analysis of Whyte's criticisms can be found in a paper by Stark (78). For the 
present purpose, it will suffice to note two points about Whyte's discussion. 
First, many of Whyte's criticisms are directed against misuses of tests, such as 
excessive reliance on personality test scores in the absence of other information 
or the administration of inadequately validated instruments. These are ob- 
jectionable practices that reputable psychologists themselves have vigorously 
condemned. 

In the second place, Whyte argues that personality tests are generally used 
in industry to select executives who are conformists and lacking in indi- 
viduality Q.e., typical "organization men”). Insofar as this criticism may be 
true, it is an indictment, not of the tests, but of the criteria of executive success 
against which they must be validated. It would apply just as much—if not 
more so—to the relatively subjective and unstandardized predictors advocated 
by Whyte, such as previous job history and interviewing techniques. The 
underlying social philosophy that Whyte so eloquently attacks is a matter for 

ebate in its own right. But it has no intrinsic relation to psychological tests 
or to their use in personnel selection. 

In the rest of this section, we shall consider the major types of personality 
tests used in personnel selection in general. These include measures of in- 
terests, personality inventories, projective techniques, and situational tests. 
Each type of test will be illustrated with two or three examples. 

Measurement of Interests. An early instrument that has stood the test 
of time unusually well is the Vocational Interest Blank (VIB). First developed 

by Strong in 1928, this test has undergone continuing revision, extension, and 
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research—including long-term longitudinal validation (80-83). In this test, 
the individual records his liking or dislike for a wide variety of activities, 
objects, or types of people commonly encountered in daily living. He is not 
required to judge in advance his interest in occupations about which he may 
have inadequate knowledge or which he may perceive in terms of misleading 
popular stereotypes. Preliminary research demonstrated that persons engaged 
in a given occupation tend to have common interests, not only in job activities, 
but also in school subjects, hobbies, sports, types of plays or books, social 
relations, and the like. It thus proved feasible to question the individual about 
his interests in familiar things and thereby determine how closely his interests 
resemble those of persons successfully engaged in each occupation. 

The Strong VIB is scored with a different key for each occupation. The 
items to be included in each key, as well as the weights assigned to cach 
response, were found empirically by comparing the percentage of persons in 
the particular occupation giving each answer with the percentage of “men 
in general” who gave it. Responses occurring significantly more often, let us 
say, among lawyers than among men in general received positive weights in 
the lawyer's key; those occurring significantly less often among lawyers re- 
ceived negative weights. By such empirical criterion keying, about fifty occupa- 
tional keys have been developed for the men's form and about thirty for the 
women's form. New keys are constructed as the need and opportunity arise. 

Other tests, such as the Kuder Preference Record (54), report interest scores, 
not in terms of specific occupations, but in terms of broad areas, such as 
mechanical, scientific, persuasive, and artistic areas. Similarly, the Study of 
Values (1) assesses the relative strength of six basic interests, motives, 0% 
evaluative attitudes, designated as theoretical, economic, aesthetic, social, 
political, and religious. From a different angle, the more recently developed 
Survey of Interpersonal Values (37) determines the relative importance the 
individual places on six aspects of his relationships with others: support, 
conformity, recognition, independence, benevolence, and leadership. 

With all these tests, the score pattern desirable for any occupation needs 
to be empirically established by the usual validation procedures. As in any 
test, scores cannot be interpreted at face value on the basis of the trait names 
ascribed to them. To some extent this is also true of the Strong VIB, especially 
for certain occupations. Different types of salesmen, for instance, show different 
response patterns on this test, so that separate keys had to be developed for 
them (e.g. 48, 102). It should be added that interest tests are used more 
extensively in vocational counseling than in personnel selection and will there- 
fore be discussed further in Chapter 16. 

Personality Inventories. This type of personality test originated in the 
attempt to identify emotional maladjustment and neurotic behavior. A num- 
ber of current personality inventories—especially those designed for use in 
clinical contexts—still focus primarily on emotional abnormality. Most of the 
newer inventories, however, are concerned with individual differences in 
emotional, motivational, and social traits among essentially normal persons. 
Moreover, such inventories typically provide a profile of scores in a number of 
traits, rather than a single score. The norms on such tests generally fall near 
the center of the score range. Deviations in either direction from such a norm 
need to be interpreted in terms of particular job requirements. Desirable score 
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patterns for each type of job may be empirically established by the usual 
follow-up procedures described in the preceding chapter. 

An example of a personality inventory is the Guilford-Zimmerman Tempera- 
ment Survey (43), which yields separate scores on the following ro traits: 
general activity, restraint, ascendance, sociability, emotional stability, objec- 
tivity, friendliness, thoughtfulness, personal relations, and masculinity. The 
items measuring these traits are presented in random order and not identified 
as such to the subject. Three typical items are: 


You start work on a new project with a great 


deal of enthusizgn M NN YES ? NO 
You are often low ín spirits a aeon eerie eee 00, YES ? NO 
Most people use politeness to cover up what is 

really “cutthroat” competition........................ YES ? NO 


A common difficulty limiting the usefulness of personality inventories in 
personnel selection is their susceptibility to faking. Many investigations have 
demonstrated that, on self-report inventories, subjects can frequently identify 
the socially desirable answers and will mark such answers instead of those 
corresponding to their own habitual behavior (see 3, pp. 59-60). The motiva- 
tion to create a favorable impression is of course stronger in a selection than 
in a counseling situation, although not totally absent even in the latter. 
Several techniques have been devised for reducing or at least detecting such 
faking. The Guilford-Zimmerman, for example, provides three verification 
keys, based on the responses to certain items, to detect falsification and care- 
lessness in responses, A procedure designed to reduce the tendency to fake is 
known as the forced-choice technique. In a forced-choice item, the subject 
must choose between answers that appear equally acceptable (or unacceptable) 
but differ in validity for a specific criterion. 

The first commercially available inventory to use the forced-choice technique 
was the Jurgensen Classification Inventory (51). Designed specifically for in- 
dustrial application, this inventory provides no general norms or scoring key. 
Test users are expected to develop scoring keys by empirical tryout against 
local criteria. Sample items from this inventory are reproduced below, together 
with abridged instructions to subjects: 


Mark the one that you think is MOST irritating and the one that you think is 
LEAST irritating. 
People who are 


Bluffers >. sA La aE Deere E 
Complainers L 


Interrupters 
Decide which reputation you would MOST prefer to have and which you would 
LEAST prefer to have. 


Considered lod Xu ANS M L 
Calm 53550. ade cass ee M L 
Ale... SOIR OMIM LO eau cine ote en M L 


Friendly 
Mark the item you prefer, using XX if your preference is strong and X if it is weak. 
Have interesting work with moderate pay — 
Have uninteresting work with high pay — 


PERSONNEL PSYCHOLOGY 


Another personality inventory utilizing the forced-choice technique i« the 
Edwards Personal Preference Schedule (EPPS) (27). Based on the personality 
theory of Murray (61), this inventory was designed to measure the relative 
strength of 15 needs, described as follows: Achievement, Deference, O:der, 
Exhibition, Autonomy, Affiliation, Intraception, Succorance, Dominance, 
Abasement, Nurturance, Change, Endurance, Heterosexuality, Aggression. In 
the inventory, items from each of these needs are paired off with items from 
the other 14. Within each pair, the two items were matched in their pre 
viously determined social desirability. A sample profile of scores on the EPPS 
is shown in Figure 18. The EPPS has not been available long enough to 
permit an adequate evaluation of its potential industrial contribution. Some 
promising results are provided by recent research with this test on the person- 
ality patterns of engineers (50) and of industrial and retail salesmen (25). It 
should be noted, however, that the forced-choice technique reduces but does 
not eliminate faking, especially by the applicant for a specific job. Even when 
items are equated for general social desirability, the individual may recognize 
that certain responses are more relevant to the particular job. 

Projective Techniques. It is principally in clinical psychology that pro- 
jective techniques have enjoyed their greatest popularity. Hence they will be 


Fig. 18 Profile on the Edwards Personal Preference Schedule. 
(Reproduced by permission. Copyright 1959, The Psychological Corporation.) 
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discussed more fully in Chapter 13. These techniques, however, are also used 
to a limited extent in industry, especially for intensive individual evaluation 
t executives and other high-level personnel. Typically in a projective test the 
ubject is assigned an unstructured task, which permits an almost unlimited 
varicty of possible responses. The test stimuli are usually vague and equivocal 
and the instructions brief and general. Such tests are based on the hypothesis 
that the way the individual perceives and interprets the test material, or 
"structures" the situation, reflects basic characteristics of his personality. 
The test stimuli thus serve as a sort of screen on which the subject 
"projects" his own ideas. Since the subject is generally unaware of the way 
his responses will be scored or evaluated, projective techniques are less suscepti- 
ble to faking than are self-report inventories, but they are not entirely immune 
to it Ce.g., 97). 
^ widely used projective technique, which has also served as a model for 
the development of many later instruments, is the Thematic Apperception 
l'est CTAT). First developed by Murray and his associates (60, 61), this test 
consists of a set of pictures, for each of which the subject is to make up a 
story. As in most projective tests, there is no objective scoring procedure, the 
stories being interpreted by the examiner in terms of Murray's theory of needs 
and of environmental forces that either facilitate or interfere with the satisfac- 
tion of these needs. Several adaptations of the TAT have been developed for 
special purposes. One of these, prepared for use in assessing managerial 
potential, provides a semiobjective scoring system (see 89, p. 392). 
Pictures are employed in a different way in the Tomkins-Horn Picture 


Fig. 19 Sample item from the Tomkins-Horn Picture Arrangement Test. 
The subject indicates the sequence that makes the best sense for the three 
pictures and writes three sentences explaining them. (Reproduction by 
permission of Springer Publishing Company, Inc.) 
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Arrangement Test (PAT), a projective technique originally designed for the 
selection and guidance of industrial personnel (94). Each item of the PAT 
consists of three sketches presented in a round-robin arrangement so as to 
minimize positional set (see Figure 19). The subject’s task is to indicate the 
order of the three pictures “which makes the best sense” and to write a 
sentence for each of the three pictures to tell the story. Most items deal with 
interpersonal relations, more than half of them portraying work situations in 
an industrial setting. Advantages of this test are that it can be group: 
administered and machine-scored. The norms are also unusually good for a 
projective test, having been obtained on a large, representative sample of the 
United States population. Subjective interpretation has not been entirely 
eliminated, however. After the objective score pattern is found for each sub- 
ject, it must be interpreted by a clinical psychologist, who may also refer to 
the subject’s written material at this stage. Like many other personality tests, 
the PAT has been used in exploratory research on the executive person- 
ality (58). 

In the light of available knowledge regarding their reliability, validity and 
other technical characteristics, projective techniques in general must still be 
regarded as research instruments. None is ready for operational use, especially 
in personnel selection. In some situations, these techniques are employed in a 
relatively subjective manner, as aids to the interviewer. When so used, how- 
ever, they do not function as tests and cannot be evaluated as such. 

Situational Tests. Although the term “situational test” first came into 
prominence during World War II, tests of this type had been developed be- 
fore that time. Essentially a situational test is one that closely resembles or 
simulates a “real-life” criterion situation. In this respect such tests are similar 
to the work-sample trade tests discussed earlier in this chapter. In situational 
tests, however, the criterion behavior that is sampled is more varied and 
complex. Moreover, interest is centered, not on aptitude or achievement, but 
on emotional, social, attitudinal, and other personality traits. 

Several kinds of situational tests have been developed for military use to 
assess the leadership behavior and human-telations skills of officers. Most in- 
volve the sort of face-to-face problem situations encountered by administrators 
ner setting. An example of such a problem situation is briefly described 

ow: 


, The examinee is evaluated as the commanding officer of a small ship. He must 
discuss with Ensign Baker, the ship's engineering officer, an unsatisfactory per- 
formance report he is about to submit. Although Baker is highly qualified in the 
technical skills required, he lacks experience. He has allowed the enlisted men to 
run his department. This has led to no specific incident in which his department has 
interfered with the ship's operating efficiency, but his behavior has set a bad ex- 
ample for the men and the other officers on board (2, pi 2). 


The subject is instructed to assume the given role and carry out the interview 
with “Ensign Baker,” whose part is played by a trained actor. The examiner 
observes and evaluates the subject’s performance with the aid of a previously 
developed checklist of effective and ineffective behaviors. Examples of such 
behaviors include: Gives reason for meeting; Allows Baker to direct con- 
versation; Vacillates in, or changes position on, unsatisfactory Report; Men- 
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tions Baker's fine engineering background; Mentions constructive nature of 
Fitness Report (2, p. 2). 

A promising type of situational test is the Leaderless Group Discussion 

LGD). Requiring a minimum of equipment and time, this technique has 
been widely used in the selection of such personnel as military officers, civil 
service supervisors and administrators, industrial executives, management 
trainees, sales trainees, teachers, and social workers (see 3, pp. 610-612; 6; 40, 
pp. 260-263). In this test, a group of examinees is assigned a topic for dis- 
cussion during a specified period. Examiners observe and rate each person's 
performance but do not participate in the discussion. Although often used 
under informal and unstandardized conditions, the LGD has been subjected 
to considerable research and some objective scoring procedures have been 
worked out for it. 

Validity studies suggest that LGD techniques are among the most effective 
applications of situational tests. Many significant and sizable correlations 
have been reported between ratings from LGD performance and follow-up 
or concurrent ratings obtained in military, industrial, and social settings. Some 
of these correlations are as high as .6o. Neither LGD nor other, more elabo- 
rate situational tests, however, have proved valid as devices for assessing broad 
personality traits. All such tests appear to be most effective when they ap- 
proximate actual work samples of the criterion behavior they are designed to 
predict. The LGD tests in particular have some validity in predicting per- 
formance in jobs requiring a certain amount of verbal communication, verbal 
problem solving, and acceptance by peers. Another factor that seems to in- 
crease the predictive validity of situational tests is job familiarity on the part 
of the raters. Such a finding again suggests that situational tests work best 
when the subject’s performance is interpreted as a work sample rather than in 
terms of underlying personality variables. 


SUMMARY 


Personnel selection may utilize tests ot intelligence, special aptitudes, achieve- 
ment, or personality. Classification programs are making increasing use of 
multiple aptitude batteries, which combine the traits measured by intelli- 
gence tests with some of the broader abilities traditionally covered by special 
aptitude tests. Designed to assess the individual's general intellectual level, 
intelligence tests actually measure largely verbal comprehension and other 
abstract functions required for scholastic achievement. Several short intelli- 
gence tests have been developed for screening industrial personnel. Intelli- 
gence tests may be used to predict performance in training, job proficiency, 
job turnover, and promotion potential. A special contribution of such tests 
is in the job placement of mental defectives. At the other end of the scale, 
research on creative talent has produced many new testing techniques, some 
of which may find their way into the intelligence tests of the future. 

Special aptitudes are often measured by custom-made tests designed for 
à particular job. Among the most common standardized tests for general use 
are tests of vision, hearing, simple motor skills, spatial aptitude, mechanical 
comprehension, and clerical aptitude. Achievement tests measure current pro- 
ficiency following relatively uniform training or experience. However, they 
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may also serve as predictors of subsequent performance. "They include written 
or oral information tests and tests of job skills. Many of the latter are standard- 
ized work samples, such as taking dictation, operating a punch press, or 
handling the matters in an executive's in-basket. 

Personality tests may be used with any type of worker but have found 
their greatest application in jobs requiring extensive interpersonal contacts, 
such as selling and executive work. It is especially important in this area to 
guard against common misuses of tests, such as undue reliance on test scores 
in the absence of other information or the use of inadequately validated in- 
struments. The major types of personality tests used in personnel selection 
include interest tests, personality inventories, projective techniques, and situa- 
tional tests. Being among the most successful of all personality tests, interest 
tests rely chiefly on the empirical criterion keying of reported interests in a 
variety of familiar experiences. The readiness with which personality inven- 
tories can be faked has led to the development of such techniques as verifica- 


tion keys and forced-choice items. While less subject to faking, projective ~ 


techniques are still largely unvalidated. Some situational tests have promise, 
especially when they closely approximate actual work samples. 
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Personal Appraisal and Biographical 


Data in Personnel Selection 


Long before the appearance of the first psychological test, job applicants were 
being selected through letters of application, references from past employers 
and personal interviews. Tests have by no means replaced these traditional 
procedures of personnel selection. In fact, as was pointed out in the preceding 
chapter, no reputable industrial psychologist would evaluate an applicant on 
the basis of tests alone. Test scores must be interpreted in the light of other 
information about the individual’s background. Moreover, good tests are not 
available for all traits, especially in the area of personality. A rating obtained 
from a well-conducted interview, or from persons who have observed the ap- 
plicant adequately in previous relevant situations, is preferable to a poor test 
or a test of unknown validity. It is also likely that for certain interpersonal 
traits, involving as they do the reactions of the individual to other persons, 
ratings will always provide the most valid predictors. 

Application forms, references, and interviews may all be used to obtain 
biographical data pertaining to education, special training, job history, and 
other relevant facts of past experience. When properly interpreted, an accu 
rate record of what the individual has done in the past is an effective predictor 
of what he will do in the future. Interviews and references also provide 
interpersonal data based on the interaction of the applicant with the inter- 
viewer and with previous acquaintances, respectively. The present chapter 
is concerned with available ways of improving application forms, reference 
reports, and interviews and with an evaluation of the effectiveness of these 
techniques. But first we shall consider rating procedures in general, since 
they underlie both reference reports and interviewing, besides having other 
important uses in personnel psychology. 


RATING PROCEDURES 


In Chapter 2 we have already seen that ratings constitute an important source 
of criterion data for most jobs. The evaluation of on-the-job performance 
against which predictors can be validated represents one application of rating 
procedures in personnel psychology. An important operational use of rating is 
to be found in the merit rating systems regularly employed by many com- 
panies. Such periodic performance ratings serve as an aid in administrative 
decisions regarding promotions, salary increments, transfers, or discharges. 
They also provide information needed for the improvement of worker perform- 
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ance through conferences or additional training. In personnel selection, rating 
procedures can be effectively incorporated into reference forms and interview 
reports. Although many rating procedures have been developed in the effort 
to make judgments more objective and precise, they can be conveniently 
classifed under seven types: order-of-merit comparisons, rating scales, scaled 
checklists, forced-choice technique, sociometric or nominating techniques, 
critical incidents, and field review method. 

Order-of-merit Comparisons. In order-of-merit procedures, the rating is 
done by ranking, comparing, or classifying people. Thus a supervisor may be 
asked to rank the 27 employees in his department in order of merit for over- 
all job performance or for any specified trait. A more precise but time-con- 
suming procedure is that of paired comparisons, in which each individual is 
paired in turn with every other and the judge merely indicates which is the 
better in each pair. 

Even order-of-merit ranking becomes laborious when the group is large. As a 
result, it may be done carelessly and inaccurately by the busy rater. A less 
demanding variant, which nevertheless yields results of sufficient accuracy for 
most purposes, is the forced distribution. In this procedure, the rater may be 
given a rating scale similar to those to be discussed shortly, but the number of 
persons to be placed in each category is specified—either rigidly or, more 
often, approximately. 

An example of a forced distribution with 4o cases is given in Table 4. The 
numbers in each category are derived from normal curve frequencies, on the as- 
sumption that the trait under consideration is normally distributed in the group. 


TABLE 4 A Forced Distribution with 40 Cases 


Rating A B Cc D E 
Number of persons to be 
assigned each rating 3 10 14 10 3 


If this assumption is correct, the trait categories in the scale can be treated 
as equal units of amount. In fact, they have the mathematical properties of 
the previously described standard scores (Chapter 2). Such equality of units 
is a further advantage of the forced distribution as compared with ordinary 
ranking. Ranks cannot be treated as equal units. The difference in job pro- 
ficiency between two employees ranked first and second in a group of 27 is 
likely to be much greater than the differences between those ranked thirteenth 
and fourteenth in the same group. This follows from the fact that in a normal 
distribution individuals cluster more closely in the center and are more widely 
scattered as the extremes are approached. 

All order-of-merit techniques presuppose that the same judge evaluates all 
members of the group on a single occasion. Such a procedure is feasible in 
certain situations, as when a supervisor rates the employees in his department. 
But there are obviously other situations to which this procedure is inapplica- 


1 For a discussion of the statistical problems and procedures pertaining to the use of 
ranks and rating scales, see Guilford (29, ch. 11). 
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ble, notably reference forms submitted on individual applicants and interview 
reports prepared as each applicant is interviewed. Another limitation of order- 
of-merit techniques is that they provide information only on the relative pos 

tion of each individual within a group; they do not permit evaluation in terms 
of absolute standards. Thus from order-of-merit data alone it would be im- 
possible to determine which individuals if any fall below minimum pro 
ficiency standards required for a job or to compare different groups that have 
been independently rated. 

Rating Scales. To obtain a record of absolute judgments in terms of an 
external standard, various types of rating scales have been developed. With 
such scales, each individual is independently evaluated. In the preparation of 
all rating scales, the basic problem pertains to the choice of units for express- 
ing degrees of the trait to be rated. These units must be easily understood 
and uniformly interpreted by all raters. 

An early attempt to achieve such intelligibility of units by an ingenious 
adaptation of ranking procedures resulted in the man-to-man rating sca 
First developed by Walter Dill Scott for use in salesman selection, this device 
was later adapted for rating army officers during World War I. The man-to- 
man scale requires a common reference group with which all raters are ac- 
quainted. From this group, the raters choose “scale men” to represent equally 
spaced degrees of the trait, such as lowest, low, average, high, and highest. 
The names of the individual scale men are then used to define these five steps. 
In applying the scale to any individual, the rater simply chooses the scale man 
whom that individual resembles most closely in the trait and assigns him the 
corresponding rating. For instance, if in his leadership behavior the individual 
resembles Henry Brown more closely than he does any of the other four scale 
men, and if Henry Brown was the “high” scale man, then this individual is 
given a rating of “high.” 

The units in a man-to-man scale are the men selected to define the scale 
positions. Such units provide objective and readily comprehensible anchor 
points for the judgment scale. A major difficulty with this technique, however, 
is that the different raters must not only be acquainted with a common refer- 
ence group but must also agree in their choice of scale men from that group. 

A common type of rating scale utilizes descriptive adjectives as its units. For 
example, an individual’s punctuality, accuracy, or sense of humor might be 
rated as very inferior, inferior, average, superior, or very superior. In the effort 
to make the units more meaningful, different sets of adjectives may be sub 
stituted to fit each trait. Thus in the rating of general ability level, the ad- 
jectives might be: stupid, dull, mediocre, clever, brilliant. Although still fre- 
quently encountered, the descriptive-adjective rating scale is one of the least 
satisfactory. Whether general or specific adjectives are employed, different 
raters are likely to interpret them with considerable variation. Not only are 
the terms themselves vague, but each rater may also apply a somewhat differ- 
ent standard in his judgments. Because of differences in their previous experi- 
ences with employee groups, one rater may describe as mediocre the same job 
behavior that another rater considers clever, 

Nor should we be misled by the false impression of objectivity and quanti- 
fication created by numerical rating scales. Substituting the numbers 1 to 5 
for the previously cited descriptive terms in no way clarifies the meaning 
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| oj a 
| Seldom on time, Absent | Occasionally late. At ESSE. record Rarely late or absent. | New 
often. Loafs on job. | tendance record not too] good. May be jate on | Works hard without | without guod steune 


Will not apply wit. — |good. Needs constant | occasion. Steady work- | driving. Turns out work | Needs very little saper- 
super ision to keep | ev but could show more| on time. Requires less | vision. Always gets job. 
busy. Could work [industry Needs nor- than average super. done on ume 
harder. ma) supervision. vision. 


Fig. 20 


A rating-scale item using graded behavioral descriptions. The 


rating is recorded by checking the proper box. (From Dooher and Marquis, 
15, fig. 10; reproduced by permission of American Management Association.) 


of the units. To rate a person "4" in sense of humor is just as vague and 
subjective as to say he is "superior" in this regard. Both types of ratings are 
subject to the same misunderstandings and judgment errors. The use of finer 
units in a numerical scale, moreover, does not necessarily increase the preci- 
sion of the ratings. To rate sense of humor on a 100-point scale is probably no 
more accurate than to rate it on a 10-point scale. Raters cannot reliably dis- 
criminate between, let us say, ratings of 73 and 74 in such a trait. Hence 
either they will not use the intervening values or they will use them in a 
haphazard and nondiscriminating manner. 

A more effective way to make the scale units meaningful is to utilize 
graded behavioral descriptions for the scale units. An example of such a scale 
is given in Figure 2o. In this scale, the worker's dependability is rated by 
checking the specific description that most nearly represents his typical be- 
havior. If it is desired to combine ratings on several traits, numerical values can 
be assigned to the different units in this scale. Unlike the units of the numeri- 
cal rating scale, however, these numbers are given concrete and uniform mean- 
ing by the behavioral descriptions that define them. 

Since human traits are continuous variables, any assignment of individuals 
to discrete categories is artificial. Raters often report difficulty in placing an 
individual into any one of the given categories, objecting that he seems to 
fall between two categories. This difficulty is obviated in graphic rating scales, 
in which the individual's trait rating may be indicated by checking any posi- 
tion along a continuous line. To ensure uniformity of interpretation by dif- 
ferent raters, it is desirable to "anchor" the graphic rating scale at several 
points by giving behavioral descriptions, as illustrated in Figure 21. The 
raters are told, however, that these behavioral descriptions do not represent 
separate categories but indicate only the direction of the line. This type of 
rating scale thus combines the continuity of graphic ratings with the inter- 
pretive clarity of graded behavioral descriptions. For most purposes, it is the 
most satisfactory type of rating scale. In recording or combining ratings from 
such a scale, it is customary to employ a stencil that divides the line into 
sections. The number of sections is usually between 5 and 10. While leaving 
the rater free to check anywhere he chooses, such a procedure recognizes the 
fact that very small differences in the position of the checkmarks are of no 
significance because of limitations in the discriminative ability of the raters. 

Scaled Checklists. In another type of rating procedure, the rater checks 
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INSTRUCTIONS TO SUPERVISORS: Place a check mark () on the 
line to the right of each employee’s name at that point which you 
believe gives the most accurate answer to the following question. 
Remember the phrases do not indicate columns. 


How does this employee make decisions when faced with non- 
routine problems? 


Jumps at Fails to foresee | Considers facts Makes sound 
conclusions. results of his and most ofhis decisions based 
Fails to consider decisions although decisions are on thorough 
facts and foresee he considered acceptable. analysis. 

results of his the facts. 

decisions. 


Fig. 21 A graphic rating scale. (From Dooher and Marquis, 15, p. 110; 
reproduced by permission of American Management Association.) 


all statements in a given list that characterize the individual. Although pre- 
sented in random order, these statements range from very favorable to very 
unfavorable and cover a variety of traits. To find the individual's over-all 
rating, the scale values of all statements checked for him are averaged. These 
scale values, which do not appear on the rating form, are derived during the 
construction of the checklist by a procedure first employed by Thurstone and 
his co-workers (75) in developing attitude scales. First, a large number of 
descriptive statements about workers are assembled. These statements are then 
presented one at a time to individual judges, who sort them into piles Cusu 
ally 11), from least to most favorable. The scale value given to each state 
ment is the mean position assigned to that statement by all the judges. 

It has been found that different groups of judges, 'such as foremen, per 
sonnel men, college professors, and students, agree closely in the mean posi 
tion assigned to such statements (76). Within cach group, any statement 
showing too wide a variability in the positions assigned to it by individual 
judges is discarded as ambiguous. The final list of statements is selected so 
as to cover a wide range of favorableness and to comprise statements spaced 
by approximately equal distances in scale values. In Table 5 will be found 
some typical checklist statements, together with their scale values and vari 
ability measures. 
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TABLE 5 Illustrative Statements from a Scaled Checklist 


Fig. 22 


for Rating Workers 
(Data from Uhrbrock, 76) 


Scale 

Statement Value* Sp" 
Is seriously lacking in judgment 13 8.45 
Is poor at following instructions 25 6.12 
Is a steady worker 69 8.57 
Occasionally develops new ideas 74 12.23 
Effectively coordinates departmental activities 89 12.18 
Is dynamic leader who stimulates enthusiasm 109 2.42 


a Statements were sorted into 11 piles by 16 judges. Scale values are mean positions 
multiplied by ro to eliminate decimals, 
b Standard deviation of positions assigned by 16 judges, also multiplied by 10. 


Forced-choice Technique. In Chapter 3, the forced-choice technique was 
described as a procedure for reducing faking on personality inventories. The 
same technique has been employed for rating purposes in the effort to mini- 
mize certain rater biases, such as the tendency to rate an individual too high 
or too low (62). Figure 22 shows two typical forced-choice items from a scale 
for rating supervisors and executives. These items are in pentad form, each 
containing five statements. The rater is instructed to mark the statement most 
descriptive and the one least descriptive of the person. As in all forced-choice 
items, the unfavorable statements in each pentad are approximately equated 
in undesirability, and the favorable statements are approximately equated in 
desirability. These matched statements, however, differ markedly in validity 
for the job under consideration. 

The statements checked for each individual are scored by recording their 
empirically established weights, which are based on their criterion correla- 
tions. Such weights, of course, never appear on the rating forms. But insofar 


‘Two items from a forced-choice rating report. (Reproduced by 


permission of Richardson, Bellows, Henry & Co., Inc.) 


mam as he carries on his job, Then, in the column headed Most, place an X over the letter 


which 


of the man and his job performance, and, in the column headed Least, place an X over the letter 
which goes with that statement. If you have any comments about your choices, write themon the 
line provided in each block, Answer every block. Only one Most ond one Least should be 
marked in any block. 


Read each block carefully. Decide which of the statements is Most Descriptive of the 


goes with the statement you choose. Next, decide which statement is Least Descriptive 


Doing everything posible to keep costs down. Sometimes says the right thing at the wrong time. 
Knows when to exercise his outhority and when not to. Studies work closely for possible improvements. 
Unable to relox after a hord day's work. Conkdenily relies on his memory which is not always 
People do not like to work for him. accurate. 
Cannot handle several details of his job a! the some Gets good results from his people. 

time. Quick to grasp information potsed on to him. 
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as a rater can detect the difference in job relevance between two equally 
desirable statements, he can still deliberately slant his evaluation of a given 
individual in either a favorable or an unfavorable direction. For this reason, 
the forced-choice technique cannot entirely eliminate rater bias, although it 
can probably reduce it in most situations. 

Nominating Technique. A promising approach to rating is based on the 
nominating technique. Originally developed in sociometry for investigating 
group structure, this technique is generally used in obtaining peer ratings, 
as when a group of co-workers, students, or officer candidates rate each 
other (48). Such ratings are used regularly in the armed services, where 
they are known as “buddy ratings" (21, 81). They are also being employed 
increasingly in industry. The nominating technique can be applied in any 
group whose members have been together long enough to ensure a necessary 
minimum of acquaintance, as in a factory, office, class, or military unit. Each 
person is asked to choose or nominate the group member with whom he 
would most like to work, eat lunch, spend his free time, or carry out any 
other specified activity. Or he may be asked who would make the best fore- 
man, district manager, officer, or group leader. The total number of nomina 
tions received by each person represents his rating. A separate rating may be 
found for each question, or all nominations may be added to provide a single 
over-all rating. 

Peer ratings apparently reflect much more than mere popularity. There 
is considerable evidence from military studies indicating that these ratings 
may be more reliable and valid than supervisors’ ratings on the same individ 
uals. Their superiority is particularly evident in ratings of interpersonal traits, 
such as leadership (81). Validity studies of peer nominations in industry 
have also yielded promising results. Life insurance agents in one company, 
for example, submitted peer nominations on a 14-item questionnaire, from 
which an overall rating was computed for each individual (8o). Without any 
knowledge of these ratings, the company subsequently promoted 100 of the 
agents to supervisory jobs. Performance ratings of these men after six months 
on their supervisory jobs were significantly and substantially related to their 
initial peer ratings, as shown in Table 6. The performance ratings were as 
signed by supervisors who had no access to the original peer nominations. 
Other studies of the validity of the nominating technique have shown it to 
be a good predictor of several practical criteria dependent upon interpersonal 
relations (see 48). 


TABLE 6 Relation between Peer Nomination Ratings 


and Subsequent Supervisory Performance 
(Data from Weitz, 80) 


—————O € MU —: 


Peer Nomination Number Percentage Rated Good or Ex- 

i _Score of Men cellent in Supervisory Performance* 
10-14 33 76 eI l 
4-9 28 54 
0-3 39 39 


D 
a The significance of the relationship is indicated by a x? of 10.08, P < o1 
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Although only applicable to certain types of situations, the nominating 
technique has proved to be one of the most dependable of rating techniques. 
These findings are not surprising when we consider some of the features of 
peer nominations. First, the number of raters is large, including all group 
members. Second, an individual's peers are often in a particularly favorable 
position to observe his typical behavior. They may be better judges of certain 
interpersonal traits than foremen, superior officers, instructors, or other outside 
observers. Third, and probably most important, is the fact that the opinions 
of group members—right or wrong—influence their actions and hence partly 
determine the nature of the individual's subsequent interactions with the 
group. Other comparable groups may be expected to react in a similar way 
toward the individual. Peer ratings thus have the same kind of validity as 
work samples. 

Critical Incidents. The critical-incident technique, discussed in Chapter 
2 as a means of gathering job-analysis data, has also been adapted for the 
rating of individual workers. A specific example of this use of critical incidents 
is provided by The Performance Record, developed by Flanagan and his asso- 
ciates (22, 23). Separate forms are available for evaluating the performance 
of hourly workers, nonsupervisory salaried employees, and foremen and 
supervisors. In each case, the supervisor records on a specially prepared record 
sheet all actions by the employee that represent either satisfactory or unsatis- 
factory critical job behavior. Categories are provided for classifying these 
actions under such headings as Judgment and Comprehension, Improving 
Equipment and Showing Inventiveness, Dependability, Accepting Supervision 
and Organizational Procedures, and Getting Along with Others. 

Although no total score or over-all rating is obtained from The Performance 
Record, the number of good and bad incidents under each category may be 
separately counted and these figures, together with specific incidents, used as 
a basis for employee conferences. Evaluation of this rating technique must 
await the gathering of reliability, validity, and other research data, which are 
almost entirely lacking at present. It undoubtedly has practical advantages in 
providing a fairly objective and systematic foundation for qualitative evalua- 
tion of employee performance. 

Field Review Method. Another procedure designed chiefly to improve 
qualitative evaluations of employee performance is the field review method 
C15, pp. 164-165). In this method, a consulting psychologist or a member of 
the personnel department interviews each supervisor regarding the perform- 
ance of employees in his department. Although the discussion is kept infor- 
mal, the interviewer sees to it that certain points are covered and adequately 
followed up in the course of the interview. Special efforts are made to elicit 
supporting facts for each statement, such as the specific ways in which one 
employee's performance is outstanding, the particular weaknesses another has 
exhibited in his relations with co-workers, and the like. 

On the basis of such an interview, the personnel worker may rate the 
over-all job effectiveness of the employee into rough categories, such as defi- 
nitely outstanding, satisfactory, or definitely a problem. These ratings, for 
example, could serve as a criterion measure if more precise quantitative 
ratings are unobtainable. Frequently, however, the object of rhe field review 
method is merely to stimulate the supervisor to evaluate the employee more 
objectively and to plan appropriate action. 
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FACTORS INFLUENCING THE 
ACCURACY OF RATINGS 


Like other predictors, ratings can be evaluated in terms of reliability and 
validity. Reliability of ratings covers temporal stability as well as interrater 
agreement. The former may be found by comparing the ratings assigned by 
a single judge to the same individuals on different occasions, the latter by 
comparing the ratings assigned by different judges to the same individuals. 

The validity of ratings has been investigated much less frequently than 
their reliability, probably because of the difficulty of finding adequate cri- 
terion measures. It cannot be assumed, however, that high rating reliability 
implies high validity. Raters may be highly consistent over time and may 
agree closely with each other while subject to common errors and biases which 
reduce validity. For the same reason, the use of a consensus or average rating 
by a group of judges as a criterion does not provide conclusive evidence of 
validity, except for such traits as popularity or leadership, which are defined 
largely in terms of social interaction. For most traits, follow-up data on the 
subject’s behavior in relevant situations represent the most satisfactory criteria 
against which to validate ratings. This is particularly true when ratings are 
used as predictors of future proficiency, in connection with selection or classi- 
fication decisions. 

Constant Errors of Judgment. Unlike variable errors, which tend to 

cancel out when the number of observers is increased, constant errors tend to 
occur in the same direction for all observers. One of the most common sources 
of rating errors is the halo effect, first empirically demonstrated and named 
by Thorndike (73) in 1920. The name was suggested by the fact that a single 
trait in which an individual is conspicuously superior or inferior may cast a 
halo” over his other traits and thus raise or lower the ratings assigned to 
him on all traits. As a result the intercorrelations of ratings on all traits tend 
to be uniformly high. For example, in an analysis of the ratings assigned to 
over one thousand employees in a single plant, intercorrelations of ratings 
for such traits as health, personality, versatility, and accuracy ranged from 
-5o to .84 (18). When objective measures are available, such as test scores, 
production figures, or accident records, the intercorrelations of these measures 
are generally much lower than the intercorrelations of the ratings for the 
corresponding traits, 

One way to reduce the halo effect is to define traits very specifically, in 
terms of concrete behavior rather than vague general concepts. In graphic 
rating scales, it is advisable to reverse the favorable and unfavorable ends of 
the line in random order for different traits (12, 26). This device serves to 
emphasize the distinctness of each trait and forces the rater to read the de- 
scriptive statements under each line rather than marking a similar position 
indiscriminately for all traits. When several subjects are to be rated by the 
same judge, à recommended procedure is to rate all subjects in a single trait 
before rating anyone on the next trait. Thus all subjects are first rated in 
cooperativeness, then all are rated in punctuality, and so on. This procedure 
tends to focus the rater's attention on each trait separately rather than on his 
general impression of each individual. Although experimental evidence for 
the effectiveness of this technique is meager and inconclusive (see 40), more 
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thorough empirical investigation of its possible usefulness in reducing halo 
effect would be desirable. 

Another constant error, known as the error of central tendency, refers to 
judges avoidance of extreme scale positions and their tendency to place an 
excessive number of persons in the center of the scale. This error is eliminated 
in forced-distribution and other order-of-merit procedures. Explaining the 
purpose of the rating so that judges are motivated to identify rather than to 
conceal individual differences helps to reduce the error. It should be noted, 
however, that, if judges do not have an adequate basis for making the re- 
quired discriminations, encouraging them to use the ends of the scale leads 
to overdifferentiation and increases the over-all amount of judgment error. In 
such a situation, reduction in range is desirable and corresponds to the re- 
gression toward the mean that occurs in statistical prediction. 

The leniency error denotes the reluctance of many raters to assign unfavor- 
able ratings, especially when ratings are used for administrative purposes. The 
ratings actually given may thus range from average to excellent, or they may 
even cluster entirely near the upper end of the scale. Like the error of cen- 
tral tendency, the leniency error reduces the effective size of the scale and 
makes ratings less discriminative. It, too, can be eliminated through the use 
of a forced distribution. Providing more favorable than unfavorable terms in 
the scale is another way to reduce this error. Judges may be less reluctant to 
characterize an individual as “only slightly cooperative" than to call him “un- 
cooperative”! The forced-choice technique and some of the other special rating 
procedures described in the preceding section were designed partly to avoid 
the leniency error. There is evidence that the forced-choice technique reduces 
but does not entirely eliminate this error (30, p. 145; 72). Under certain 
conditions, moreover, graphic rating scales may be just as satisfactory as 
forced-choice procedures in this regard (7). 

Constant errors of judgment may also arise from social stereotypes. Con- 
trolled experiments have demonstrated, for instance, that ratings for such 
traits as intelligence, dependability, industriousness, talkativeness, and con- 
scientiousness may be influenced by the wearing of spectacles or lipstick (49, 
74). Other stereotypes stemming from folklore or from the pseudoscientific 
systems of phrenology and physiognomy may also affect ratings in many 
psychological traits (64). The association of a high forehead with intelligence, 
a square jaw with determination, or a steady look with honesty are familiar 
examples. Such stereotypes may influence ratings even when the rater is un- 
aware of their operation. What the rater accepts as a hunch or general im- 
pression may in fact stem from an unrecognized response to some irrelevant 
physical cue. Judgment errors resulting from social stereotypes tend to be most 
prevalent in ratings based on a brief interview or other initial contact. The less 
behavioral information the rater has about the individual, the more likely 
is he to rely upon stereotypes and snap judgments. 

Amount of Acquaintance. It is obvious that a certain minimum of ac- 
quaintance with the person to be rated is an essential prerequisite for accurate 
rating. In the absence of concrete knowledge about the subject’s behavior, 
the rater tends to be misled by irrelevant characteristics and to invoke clichés, 
generalities, and stereotypes. That increasing amount of acquaintance im- 
proves both reliability and validity of ratings was illustrated by an analysis of 
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the ratings assigned to 1,597 assistant managers in an insurance company b; 
traveling field representatives (20). Studies of buddy ratings obtained during 
officer candidate training, on the other hand, suggest that beyond a certain 
point longer acquaintance may have little value (37, 81). Peer nominations 
made early in the training period closely approximated later ratings obtained 
by the same procedure. Under these conditions, the close and extensive con- 
tacts among the subjects and their many opportunities to observe relevant 
behavior undoubtedly contributed to the accuracy of early ratings. 

A useful distinction between personal acquaintance and "trait acquaintance” 
was proposed as early as 1922 by Slawson (65). It is not how long the rate: 
has known the subject that matters so much as the opportunities he has had 
to observe the trait being rated. A rater, for example, may have known an 
individual for ten years and yet may never have seen him in situations where 
ability to work with a group could be manifested. In an investigation of 
ratings assigned to teachers by their supervisors, Slawson (65) found higher 
interrater reliability for those traits for which more extensive objective evidence 
was available to the raters. Moreover, the relative rater reliability of different 
traits varied from school to school, depending upon the opportunities for 
observation provided by local circumstances. A more recent illustration is to 
be found in a study of supervisory ratings of clerical employees, which re- 
vealed higher rater agreement and less leniency error in the ratings of such 
“job” traits as amount of work done and ability to do complicated jobs than 
in the ratings of less readily observable “personal” traits such as conscientious- 
ness and ability to work with others (71). When these personal traits were 
more fully defined in terms of concrete job behavior, the difference in rater 
teliability between the two sets of traits disappeared. 

Increase in duration of personal acquaintance, without a corresponding 
increase in opportunities to observe the trait rated, may actually reduce the 
accuracy of ratings. Both halo effect and leniency error, for instance, tend to 
increase with very long acquaintance (45, 65). An effective technique to 
ensure that the judge will rate only traits for which he has trait acquaintance 
with the ratee is to include after each trait a space to be marked when there 
was no opportunity to observe, as illustrated in Figure 23. Apparently, when 
raters have inadequate information, they are more willing to mark such a 
designated space than to leave the trait unmarked. A request for supporting 
evidence or specific examples of behavior is a further way of encouraging 
judges to rate only those traits they have actually observed. 

Rater Characteristics. Ratings may vary in a number of ways as a func- 
tion of the rater himself. Some raters are characteristically "hard," others 
characteristically “easy.” Thus the same group of subjects may cluster at 
different parts of the scale when evaluated by different judges, even if the 
relative position assigned to each subject remains approximately the same. 
Raters may also differ in the spread, or variability, of their ratings, some 
utilizing only a narrow portion of the scale, others spreading their ratings 
over a much wider range. 

Rater biases in mean and variability can be eliminated by converting all 
ratings to standard scores (see Chapter 2). This procedure will have the effect 
of expressing the ratings by all judges in terms of a common mean and SD. 
Such a conversion is needed when we wish to average the ratings assigned by 
different judges to the same individuals. If the various judges have rated 
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DIRECTIONS: Place a check mark (j^) on the line under each of the 
following traits. Note that each trait is defined, and it is this defi- 
nition that should be used in rating rather than your own idea of 
what the terms mean. Choose the point on each line most nearly 
fitting the student. If you have had no opportunity to observe a 
particular trait, check the section marked “‘not observed.” 


SELF-EXPRESSION (Skill in Communication): Ability to manipulate 
language symbols and to formulate thought for effective com- 
munication. 


Expresses self ^ Good self- Average ability ^ Poor self- Noticeably poor Not 
extremely well. expression. to express ideas. expression. Has in ability to Observed 
Formulates and Usually gets his Partly lacking difficulty in communicate 
communicates ^ ideas across. in effective making himself ideas. 

thoughts clearly. communication, understood. 


INDUSTRY: Application to work and responsibility; effort expended 
to get work done. 


— $$ 005 0 SEE 


Applies himself Applies himself Average industry. Poor attitude Never does today Not 
thoroughly and — to the extent Sometimes toward whathecanput ^ observed 
energetically to necessary. shirks work, responsibilities, off till tomorrow. 

all tasks. Diligent worker. 


Fig. 23 Two items from a graphic rating scale with provision for lack of 
trait acquaintance. (Adapted from Student Rating Scale, Fordham College.) 


different groups, however, it should be borne in mind that the use of standard 
scores will serve to eliminate any real differences in level or variability from 
one group to another. The operation of rater biases can be somewhat reduced 
during the rating process itself by defining the traits to be rated fully and 
concretely and by specifying the standards in terms of which subjects are to 
be rated. "M 
Raters also vary in the relative positions they assign to particular individuals. 
Rater A may regard John Brown as much more dominant than Elmer Green, 
while Rater B holds the opposite view with equal conviction. To some extent 
these differences may reflect idiosyncracies of the raters, arising from their 
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past experience, their differing concepts of the traits, and similar factors. Some 
of the variation, however, may result from the fact that the subjects have been 
observed in different situations by the two raters. Even in similar situations, 
the raters’ own manner, appearance, reputation, and other characteristics may 
evoke different behavior from the same individual. Thus some of the variation 
among the ratings assigned by different judges may reflect genuine differences 
in the behavior of the subject under different circumstances. 

The relation between raters’ characteristics and their ability to judge others 
has been extensively investigated (see 3o, pp. 149-152; 69). Accuracy of 
judgment is positively related to intelligence and academic ability. Satisfactory 
emotional adjustment is likewise associated with good judging ability, prob 
ably because the poorly adjusted rater allows emotional biases to affect his 
judgments. The good judge also tends to be more tough-minded, impersonal, 
and socially detached. It is partly for this reason, perhaps, that some studies 
have found physical scientists and experimental psychologists to excel clinical 
psychologists in judging others. A clinical, counseling, or social service orienta 
tion may be a handicap in the objective evaluation of an individual's assets 
and liabilities. 

While much more research is needed to spell out all the qualifications f 
a good rater and the reasons for such qualifications, it is clear that differenc 's 
among raters are large and significant. A number of experiments using a varic y 
of rating procedures agree in one conclusion: Who does the rating makes 
more difference in the ratings received than do the characteristics of the rating 
scale, the rating technique employed, or the conditions under which ratings 
are made (4, 6, 21). 

Training. Much of what has been said regarding factors that influence 
the accuracy of ratings suggests the desirability of training raters. Research in 
both industrial and military settings has demonstrated the effectiveness of 
training in increasing the validity of ratings and in reducing such judgment 
errors as halo effect and leniency error (10). Training programs are especially 
appropriate for supervisors who must assign periodic merit ratings as part of 
their regular duties. They are also helpful for selection interviewers who must 
rate applicants and for anyone who is asked to provide criterion ratings. 

Rater training programs vary in duration and thoroughness depending upon 
their purposes and the available facilities (1o, 16). Even a relatively simple 
orientation session lasting only an hour or two will produce noticeable results. 
In general, such a training program should explain the aims and purposes of 
the ratings, provide information on common judgment errors such as halo effect 
and stereotypes, stress the importance of obtaining maximal differentiation and 
avoiding both leniency and central tendency errors, clarify and illustrate the 


meaning of the traits to be rated and of the scale units, and give supervised 
practice in the assignment of ratings. 


APPLICATION FORMS 


One of the best-known procedures of personnel selection is the evaluation 
of candidates through letters of application or application forms. The tradi- 
tional letter of application stating the applicant’s availability and qualifications 
for the job is subject to many errors. Owing to inadequate knowledge of job 
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requirements, the applicant may be unable to determine what personal charac- 
teristics or background facts are most relevant to the job. Moreover, adverse 
factors are not likely to be included and qualifications may be deliberately 
falsified. Characteristics of the letter itself, such as neatness, handwriting, 
grammatical or spelling errors, organization of content, and effectiveness of 
expression may influence the reader in many ways. Of course, when the job 
duties include extensive letter writing, the letter of application may be re- 
garded as a job sample. But in many jobs, such skills are quite irrelevant. If a 
photograph is included for jobs in which appearance is of little relevance, 
physical stereotypes may further distort judgment. 

Application forms, or biographical inventories, ensure more uniformity in 
the facts ascertained, since the employer determines what items are to be 
included. The information requested generally deals with age, marital status, 
number of dependents, previous job history, and education. Club member- 
ships, hobbies, and other social and recreational activities are often covered. 
Also relevant for many jobs are data on present and previous financial position, 
home-ownership, and amount of insurance carried. The last-mentioned item, 
for instance, has been found to be specifically related to success in selling life 
insurance (3). It was shown that this relation could not be attributed simply 
to differences in income level among salesmen. Apparently the salesman who 
believes strongly enough in the value of what he is selling to buy it himself 
is better able to convince clients of its merits. Much of the information 
regarding domestic and financial condition is obtained because of the clues it 
may provide to the applicant's stability, responsibility, and maturity of judg- 
ment. Similarly, social and recreational activities may be related to personality 
characteristics. 

Because of the high correlation between educational achievement and in- 
telligence test scores (see Chapter 3), educational data are effective predictors 
of performance in many jobs, especially when promotion potential is of 
interest. In addition, extracurricular activities may furnish information about 
the individual’s leadership qualifications and other social traits. A study of the 
relation between salary and academic grades, conducted on nearly four 
thousand employees of a large company, provides relevant data (132. Not 
only was salary positively related to average grade earned in college, but this 
relationship was higher the longer the employees had been out of college. 
Thus college grades made relatively little difference in entry salary, but they 
were substantially related to advancement in the company and ultimate job 
level. A similar though less pronounced relation was found when salary was 
compared with extent and level of extracurricular participation in college. 

That both extracurricular activities and scholastic achievement in college 
are significantly related to later job success was also demonstrated in another 
investigation, based on a questionnaire survey of a college class 30 years after 
graduation (39). Those who had excelled as undergraduates, whether in 
grades, sports, student government, or other college activities, had achieved 
far beyond average success at middle age. These relationships were found in 
both directions, i.e, by examining the subsequent vocational status of out- 
standing undergraduates or by checking the undergraduate performance of 
vocationally successful men. ber? ; 

It is evident that application forms, by providing information ab 
biographical data, can serve as effective aids in personnel selection. 


n about relevant 


But the 
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relevance of items and the significance of specific responses must be em- 
pirically determined. Otherwise, the application form may prove wasteful 
and misleading. In developing an application form, the personnel psychologist 
is guided in his original choice of items by available research on the relation- 
ships between biographical data and job criteria, as illustrated by the studies 
cited above. A further, more specific guide is provided by a job analysis of the 
job under consideration. The final selection of items, however, can be done 
only through an empirical validation of specific responses against a criterion 
of job success. This process is similar to test validation. It includes the selection 
and weighting of items in one group of employees and a cross validation of the 
complete application form in another group (see Chapter 2). However 
relevant a biographical item may seem for a particular job, its validity cannot 
be assumed without such empirical verification. 

The procedure followed in developing a weighted application blank is 
illustrated in Table 7, which gives some of the items from an application 
blank used at Yale University for selecting clerical and secretarial employees 
(24, pp. 30-36; 25). In an effort to reduce turnover, items were chosen against 
a criterion of job stability. The subjects were 120 women, all of whom had 
been hired as permanent employees. Half of these employees had been on the 
job from two to four years and were still on it (“long-term group”); the other 
half had remained on the job less than two years, most of them having left 
within their first year (“short-term group"). Weights were assigned on the 
basis of the percentages of long-term and short-term employees who gave each 
response. It will be seen in Table 7, for instance, that living within the city 
was much more frequent in the longtenure than in the short-tenure group. 
Accordingly, this response received a weight of +2, while living in the 
outlying suburbs received a weight of —2. Similarly, the long-tenure employee 
in this particular group of jobs was more likely to be in the older age brackets; 
if she had children, they were of high school age or older. Previous salary, 
on the other hand, failed to differentiate between the two tenure groups and 
thus received zero weights (i.e. it was omitted from scoring). 

All other items on the blank were evaluated in the same way. An applicant's 
score on the entire blank is the algebraic sum of all item weights. Cross 
validation of this weighted application form in a second sample of 85 employees 
yielded a correlation of +.51 with job tenure. In this sample, the average score 
of the short-tenure group was —o.7, that of the long-tenure group 6.3. 
Although for most purposes the above procedure for finding item weights 
is probably adequate, more refined techniques can be used when the amount 
and nature of available data justify them. Each item response, for example, 
may be correlated with the criterion. The weights are then assigned according 
to the size and statistical significance of these correlations. Another procedure, 
known as the “horizontal per cent method,” uses as the weight the percentage 
of employees giving each answer who are in the successful category (68, pp. 
256-257). Suppose, for example, that 80 employees report high school 
graduation and that within this group 48 are successful and 32 unsuccessful 
on the job. The percentage of successful employees giving this answer is 


8 
thus 60 (£x 100 = 6): Rounded to one digit, the weight assigned to high 


school graduation is 4-6. 
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TABLE 7 Determining Item Weights for a Weighted Application Blank 
(From Fleishman and Berniger, 25, p. 65. Reproduced by permission 
of American Management Association and Personnel) 


———————————— 


Percentage of 


Application- Short-tenure Long-tenure Weight Assigned 
blank Items Group Group to Response 


—————————M——MM———— Bá 


Local address 


Within city 39 62 T2 
Outlying suburbs 50 36 —2 
Age 

Under 20 35 8 —3 
21-25 38 32 —1 
26-30 8 2 =l 
31-35 7 10 0 
35 and over 11 48 F3 
Previous salary 

Under $2,000 31 30 0 
$2,000-3,000 41 38 0 
$3,000—4,000 13 12 0 
Over $4,000 4 4 0 
Age of children 

Preschool 12 4 —3 
Public school 53 33 -3 
High school or older 35 63 3e 


Beginning in the early 1920s with a form for the selection of salesmen (28), 
weighted application blanks have been developed for many types of jobs. One 
of the most extensive current users of this technique is the Life Insurance 
Agency Management Association, which has developed a system of scoring 
weights for evaluating biographical data of life insurance salesmen. Among 
the jobs for which weighted application blanks have proved valid are those 
of salesmen, salesclerks, and sales managers (34, 42, 47, 53, 57, 63); clerical, 
secretarial, and other office workers (25, 43, 46, 52, 78); chemists and engi- 
neers (36); seasonal production workers (17); service station dealers (67); 
and high-level executives (44). The technique has yielded promising results 
in the identification of research competence and scientific creativity (66, 70). 
It has been effectively employed during and after World War II in the selec- 
tion of army officers, air force pilots, and other military personnel (31, 61). 
Exploratory studies have also been conducted on its possible uses with college 
students (1, 9) and with ROTC cadets (35). 

While the weighted application blank, or biographical inventory, has proved 
successful in a wide variety of situations, certain cautions should be observed 
in its development and use. First, the validity of items is specific to the job. 
A personal or background characteristic that is favorable for one job may be 
irrelevant or unfavorable for another. Such specificity may extend to very 
similar jobs, such as selling different products. It may also hold for different 
criteria within the same job. For example, different scoring weights may be 
required to predict earnings, turnover, accident rate, or promotion potential 
for the same job. 


92 


PERSONNEL PSYCHOLOGY 


It is also important to check for possible nonlinear relationships between 
items and criterion. In some jobs, for instance, maximal proficiency may be 
associated with the intermediate ages or educational levels. In such cases, 
lower or negative weights would be assigned to both extremes, while the 
intermediate values would receive the highest positive weights. This relation 
ship is similar to that found between intelligence or aptitude test scores and 
success in certain jobs, as discussed in Chapter 3. Still another point to check 
is preselection. If applicants have already been screened on the basis of any 
of the characteristics included in the blank, a comparison of successful and 
unsuccessful workers among those hired may fail to reveal any validity for 
this characteristic (58). For example, when only high school graduates arc 
hired, amount of education may show no relation to later job success of 
employees. But if education is in fact related to success on this job, dropping 
this item will worsen rather than improve the selection process. There are 
several ways of handling preselection, depending upon specific circumstances. 
But in no case can we assume either the absence of preselection or the 
irrelevance of preselection items. 


REFERENCE REPORTS 


Another common selection procedure is to investigate the applicant's "refer- 
ences.” This involves essentially the obtaining of information about the 
applicant from persons acquainted with him. Such persons are usually former 
employers, although teachers, co-workers, and personal acquaintances may 
also be included. A major use of references is to verify the job experience 
claimed by the applicant. It is also expected that the applicant's performance 
in previous jobs or other situations will be predictive of his performance in the 
prospective job. Since this second objective requires the identification and 
appraisal of relevant behavior, it presents a much more difficult task. 

The reference may be in the form of a letter of recommendation, in which 
the writer is left free to choose the content. It has been aptly said of such 
letters that "they are often sealed with a shrug and opened with a smile. The 
letter may be only one way of speeding the parting guest. The enthusiasm of 
the writer may indicate only his joy over a separation long overdue." This 
criticism is particularly applicable to the open letter given to the applicant upon 
termination. Not being confidential, it is likely to contain only favorable state 
ments. It will also tend to be vague and general, since the writer has no knowl- 
edge of the specific jobs for which it will eventually be used. 

A letter mailed directly to the prospective employer has the advantage of 
confidentiality. It can also be made more relevant to the prospective job, since 
at least the job title and the company are known to the writer. Even under 
these conditions, however, the writer may have little familiarity with actual 
job requirements. A still better type of letter is one written in response to an 
inquiry from the prospective employer. In such an inquiry, the nature of the 
job can be further specified and relevant items of information can be requested. 

From the third type of letter it is a short step to an employment recommenda: 
tion questionnaire (ERQ). This type of questionnaire is coming to be used more 


2 Quoted by Poffenberger (59, p. 238). 


93 


Personal Appraisal and Biographical Data 


and more in place of the open-ended testimonial letter and narrative recom- 
mendation (54). Recommendation questionnaires can be developed by the same 
procedures as weighted application blanks. Items can be chosen on the basis 
of their empirical validities in predicting any desired criterion. The responses 
can be scored in terms of empirically established weights, and a total score 
on the entire form can be computed. Insofar as certain items may call for the 
rating of the applicant’s previous job performance, abilities, or personality 
traits, rating procedures can be incorporated into the form. The previously 
discussed graphic rating scale and the forced-choice technique are especially 
suitable for this purpose, 

Unlike application blanks, however, recommendation forms have so far 
been subjected to little research. Although widely used, they usually consist 
of subjectively chosen items of unverified validity. Either the responses are 
examined qualitatively, or crude scoring weights are assigned to them in terms 
of their "face value.” Under these conditions, it is not surprising that follow-up 
studies have revealed little or no validity for the personnel evaluations based on 
such forms. 

Among the few research publications dealing with recommendation forms 
is a series of studies conducted in the Federal civil service (27, 55, 56). In 
one of these studies (55), the ERQs of 1,193 employees in 12 skilled trades 
were analyzed. Standard recommendation forms had been mailed prior to 
employment to the references listed on each man’s application blank. An 
average of about four ERQs were sent out for each applicant. OF these, 56 
per cent returned completed, 23 per cent returned incomplete, 18 per cent 
failed to return, and 3 per cent returned unopened. These figures are described 
as typical of the return rate of Federal ERQs. Total scores for each applicant, 
found by applying “rational” rather than empirical weights to the ERQ items, 
yielded low and generally insignificant correlations with a criterion of sub- 
sequent job performance based on supervisory ratings. An analysis of separate 
items on the ERQ showed poor discriminative power, with heavy concentration 
of responses on the favorable answers. 

Another technique for investigating references is the telephone checkup. 
This procedure has several practical advantages over mailed questionnaires. It 
is likely to yield a much larger proportion of completed inquiries. Respondents 
are generally less reluctant to give a frank and full evaluation of the applicant 
orally than in writing. Through skillfully worded ‘questions, proper sequence 
of items, and other devices, the trained telephone interviewer can do much to 
reduce suggestion, halo effect, and other judgment errors. Frequently he can 
also pick up clues in the respondent's remarks, tone of voice, inflection, hesita- 
tion, or other expressive behavior that suggest the need for further probing. 
As a result, he may elicit important facts that would not have been provided 
spontaneously or in answer to routine questions. i 

A more timeconsuming but effective procedure is the full-scale field in- 
vestigation. Relying on face-to-face interviews with several persons who know 
the applicant, these investigations can provide the most comprehensive and 
unbiased record of past performance. They generally cover not only job 
proficiency but also other information bearing on abilities, personality, and 
character. In one of the previously cited civil service studies (56), field 
investigation findings on 109 applicants for three professional positions (econo- 
mist, budget examiner, and training officer) were compared with the ERQ 
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scores for the same persons. Although significant correlations were obtaine: 
for two of the three positions, none of the correlations was high. In severa 
cases, moreover, the ERQ failed to detect extremely disqualifying featur 
revealed by the field investigation. This failure may have resulted in part fro: 
a selective bias in ERQ returns. Persons who have adverse information abou 
an applicant and cannot recommend him favorably often fail to respond at 
all to a written inquiry. To determine how much confidence can be placed 
in field investigation data, however, the investigator should also evaluate the 
respondent in the course of the interview, checking on his accuracy and 
dependability as well as on any possible biases toward the applicant. 


INTERVIEWING 


Interviewing techniques are used for many purposes in journalism, law, medi 
cine, social work, clinical psychology, counseling, public opinion polling, and 
market research. Within personnel psychology, interviews are commonly 
employed not only as selection procedures but also in conducting employee 
attitude surveys, as a means of communication between management and 
workers, in handling grievances, and in other supervisory functions. A specia! 
application is provided by the exit interview, held with employees who 
leaving the company. Apart from dealing with the routine details of termina- 
tion, such an interview can serve a public relations function by ensuring that 
workers leave with a more favorable attitude toward the company. It can also 
yield valuable data on the causes of turnover as found both in worker charac- 
teristics and in conditions within the company. Still another type of personnel 
interview is that used for employee development. Related both to merit rating 
and to training programs, this type of interview is designed to let the employee 
know how well he is doing on the job and to help him improve. It is used 
especially with supervisory and executive personnel in so-called “management 
development programs.” 

Psychological discussions of the interview can be found in a number of 
books. Some cover all types of interviews (8, 41); others concentrate on 
personnel selection interviews (5, 19); and one focuses entirely on the em- 
ployee development interview (51). Some include a survey of published 
research on the interview (5, 8, 41); others are essentially how-to-do-it books 
on interviewing procedures (19, 51). Within the present chapter, we shall be 
concerned only with the interview as a personnel selection tool. Much of 
what is said about interviewing in this context, however, will also apply to 
other uses of interviews. 

Nature of the Selection Interview. Through face-to-face conversation 
with the applicant, the selection interview provides two major sources of 
information, namely, a behavior sample and a reactional biography. As a 
behavior sample, the interview permits the direct observation of certain traits 
such as voice, speech, use of language, nervous mannerisms, and general 
appearance. Since the interview is a dynamic interaction between two persons, 
it may also yield clues to certain complex social traits, such as poise, dominance, 
emotional control, tact, and egocentrism. 

The second source of interview information is the biographical data reported 
by the applicant in response to questioning. Such data should pertain not only 
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to job experiences but to the complete life history. Moreover, they should 
cover not only what has happened to the individual but also how he reacted 
to it and how he now perceives it. To have been fired because of a fight with 
the supervisor may have a different significance if the individual perceives it 
as an instance of his own emotional immaturity, as plain tough luck, or as 
systematic persecution. Similarly, we want to find out not only what grades he 
received in school but also what subjects he liked and disliked, and why. 

On the interviewer's part, the interview requires skill in data gathering and 
in data interpreting. An interview may lead to wrong decisions because im 
portant data were not elicited or because given data were inadequately or 
incorrectly interpreted. An important qualification of the successful inter 
viewer is sensitivity in picking up clues in the subject's behavior or in facts 
he reports. Such clues then lead to further probing for other facts that may 
either verify or contradict the original hypothesis. Thus the interviewer en- 
gages in the cycle of hypothesis formation and hypothesis testing discussed in 
Chapter 1. 

The chief potential contributions of the interview to the selection process 
include, first, the assessment of traits for which no satisfactory tests are avail- 
able. It would be foolish to try to gauge intelligence or mechanical aptitude, 
for instance, through an interview, since tests can do the job quicker and 
better. For many social, emotional, and motivational traits, however, an in- 
tensive interview provides the best source of information. A second advantage 
of the interview is that it permits fuller coverage of biographical data than 
is possible through application blanks. Through selective probing, the inter- 
view makes it possible to explore a particular area more intensively as the 
individual’s own responses point the way. This individual adaptability of 
procedure, of course, is impossible with such mass techniques as the applica- 
tion blank. 

A third contribution of the interview concerns the combining of data to 
arrive at final evaluations or decisions. Characteristically, the interviewer 
utilizes a clinical approach in combining both interview data and information 
obtained from tests and other sources (see Chapter 2). Such a procedure 
permits the qualitative interpretation of trait patterns, of the interaction of 
different factors, and of rare circumstances that may be important in individual 
cases but unsuitable to statistical treatment. 

Form of the Interview. The traditional personnel selection interview is 
haphazard, unsystematic, and impressionistic. It is generally brief and un- 
planned. As such it is likely to vary at random from one applicant to another 
and to elicit as much irrelevant as relevant information. At the other extreme 
is the completely standardized interview, which may be little more than an 
orally administered questionnaire. Because of its rigid and artificial nature, 
such a procedure sacrifices the opportunity for personal interaction and flexi- 
bility that an interview can provide, while requiring more staff time than the 
administration of a written questionnaire. 

Between these two extremes is the patterned interview. Also known as 
“guided,” “semistructured,” and “systematic,” this type of interview covers 
certain specified areas, such as work history, education, early home back- 
ground, present domestic and financial condition, and social and recreational 
activities (19, 38, 50). The sequence of areas is usually uniform, and each is 
introduced with a comprehensive standardized question. Within these limits, 
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however, the interviewer has considerable latitude. His role is to steer the 
conversation into relevant channels and make sure that all areas are ade- 
quately explored. Through follow-up questions, he checks on any points that 
may have been omitted by the applicant or that seem unclear or suspect. 

Sources of Error in Interviewing. Interpretation of interview data is 
subject to the various judgment errors discussed in connection with rating. 
Since evaluations must be made on the basis of a brief contact, halo effect and 
social stereotypes are especially likely to operate. The interviewer's “hunches” 
and "intuitions" often arise from just such judgment errors. Some hunches 
may result from a chance resemblance to a former acquaintance. If a previous 
employee who embezzled company funds happened to have widely spaced eyes 
which gave him a distinctive appearance, the interviewer may respond with a 
vague feeling of distrust when he encounters an applicant with the same 
facial peculiarity. This feeling may be aroused without the interviewer's 
awareness of the basis of his response—hence the mysterious and awesome 
nature of many hunches, 

Hunches as such should not be summarily dismissed in the course of the 
interview. Rather, they should be examined. First, we should try to identily 
the basis of the hunch. Recognizing the underlying cue will in itself go far 
toward showing whether the hunch was based on a stereotype or chance 
resemblance, or whether it occurred in response to a relevant fact in the 
applicant’s behavior or past record. Second, the hunch should be regarded, 
not as conclusive in itself, but as a hypothesis to be tested by further probing. 

Another type of error that may affect interview findings has been called 
contagious bias. This error refers to the effect that the interviewer's own 
beliefs, expectations, or preconceived notions may have upon the subject's 
responses. The term contagious bias was first used by Rice (60) in an early 
analysis of sociological survey data. Twelve investigators interviewed a total 
of 2,000 homeless men applying for lodging at a municipal lodging house. 
One of the items dealt with the men’s own explanations of their destitution. 
Although the men were assigned at random to each interviewer, certain sub 
groups differed significantly from others in their answers to this question. 
Among those interviewed by an ardent prohibitionist, 34 per cent cited liquor 
as the cause of their difficulties, while 42.5 per cent cited industrial conditions. 
In contrast, among those interviewed by an investigator with a socialist bias 
the following results were obtained: 11 per cent, liquor; 60 per cent, industrial 
conditions. 

More recent studies, some of which used tape recordings, have repeatedly 
corroborated the effect of interviewer bias and have thrown some light on its 
operation (see 41, ch. 7). Significant differences in the findings of different 
interviewers may occur through incorrect recording of data, misunderstanding 
of ambiguous answers, or actual differences in the subjects’ responses. The 
interviewer may inadvertently inject his ideas into the conversation by his 
wording of questions, by reacting to the respondent's answers in ways that 
differentially reinforce certain types of answers, or by suggesting appropriate 
answers when the respondent hesitates. He may also slant the results by ac- 
cepting some ambiguous answers while following up others by further probing. 
Insofar as the respondent perceives the interviewer's bias, he tends to follow 
the lead. This reaction may be regarded as a special instance of the “social 
desirability” factor that affects both interview replies and personality inventory 
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responses (see Chapter 3). In both situations, the respondent tends to give 
replies that are "acceptable" to the recipient (79).* 

Interviewer Training. Since the interviewer himself is an important ele- 
ment in the interviewing process, the most effective way to improve the inter- 
view is by better selection and training of interviewers. In all too many 
companies, however, personnel interviewers are still chosen because of their 
"interest in people" and clean-cut appearance. Often they are persons lacking 
special qualifications who cannot be fitted into any other job. Actually, inter- 
viewers should be selected like any other technical personnel, in terms of 
relevant abilities, personality traits, educational level, and specialized training. 

Intensive training courses in interviewing itself, usually conducted by an 
outside consulting psychologist, have proved successful in many companies. 
Such courses should include an introductory orientation in interviewing 
principles, followed by skill training. The latter may utilize any convenient 
combination of direct observation of interviews, tape recordings, transcripts, 
role playing in which trainees assume in turn the roles of interviewer and 
applicant, and supervised practice in interviewing genuine applicants (41, 
ch. 9). 

In the course of such training, the interviewer is alerted to the operation of 
various judgment errors and biasing effects. He is also introduced to other 
practices designed to improve the effectiveness of interviewing, such as record- 
ing of facts promptly rather than trusting to memory, citing supporting 
evidence for judgments, and preparing for each interview by studying both 
the job specifications and any available information about the applicant from 
tests, application blank, telephone checkups, and other sources. Such prepara- 
tion ensures that the interviewer focus on relevant qualifications and provides 
hypotheses to be checked in the course of the interview. 

'The importance of establishing good rapport and putting the applicant at 
ease at the opening of the interview should also be recognized. Most applicants 
are inclined to be tense on first contact with the interviewer, especially if they 
are eager to get the job. In fact, unusual freedom from tension in such a 
situation may indicate either lack of interest in the job or extensive interview 
experience because of frequent job changes. In both instances, the attitude 
would be an unfavorable sign, although it may, of course, have other ex- 
planations that would emerge in individual cases. . 

Training programs often employ interviewing manuals and forms specially 
developed for the particular company. Much of the orientation material, as 
well as areas to be covered, opening questions, and other interviewing aids, 
can be succinctly presented in such fashion. 

Interview Research. In contrast to the extensive use of the interview 
as a selection device, relatively little research has been done on it. Among 
available published studies, some are concerned with factors influencing inter- 
view results, such as characteristics of the interviewer and of the interviewing 
methods employed (41, ch. 7). Much of this research is relevant to the ques- 
tion of interviewer biases, discussed in an earlier section. 

Another group of studies deals with the interviewing process itself. Several 
provide only descriptive accounts of what occurs during the interview. They 
may cover such features as total duration of interviews, proportion of time 


3 Further discussion of interviewer bias will be found in Chapter ro. 
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taken up with interviewer talking, respondent talking, and silent pauses; mean 
length of individual exchanges; and time spent on different kinds of questions 
and answers by interviewer and respondent (14). A suggestive finding of one 
study was that the interviewer tends to talk proportionately more with appli- 
cants he accepts than with those he rejects, more time being spent in empty 
pauses with the rejected cases (2). Of course, such relationships may well vary 
with interviewing methods and other circumstances. 

The total time covered by the interview is also an important consideration. 
A survey of 65 companies showed the most frequent total duration of selection 
interviews to be 15 minutes for unskilled workers, 3o for skilled workers and 
for clerical workers, and 45 for technical workers (see 5, p. 17). The entire 
range extended from 1 or 2 minutes to several hours per interview. When last- 
ing only a few minutes, the interview can provide little more than a general 
impression of appearance and a few superficial behavioral characteristics. 
Stereotypes and other judgment errors are also likely to play a major part in 
such brief interviews. For intensive evaluation of high-level personnel, two 
or more hours are generally required. 

A recent well-controlled investigation utilized synthetic protocols of. inter- 
view information, assembled so that favorable and unfavorable items could be 
matched in number and strength and could be presented in predetermined 
orders (11). An initial set to accept or reject the applicant was established by 
providing the interviewer with a preliminary test score. Figure 24 shows the 
effect of successively presented favorable items (in protocol A) and unfavorable 
items Cin protocol C) in shifting the interviewers’ ratings away from the initial 
position. Each graph is based on mean ratings of 16 interviewers. It can be 
seen that an average of 8.8 favorable items was required in protocol A to shift 
the ratings from “reject” to “accept.” In contrast, an average of only 3.8 un- 
favorable items was sufficient to shift ratings from “accept” to “reject” in 
protocol C. These results suggest that trained interviewers may develop a 
differential sensitivity to unfavorable items, probably because such items are 
less likely to be revealed by interviewees and hence are more significant than 
favorable items when they do emerge. Position of items in the protocol also 
affected interviewer response. The earlier items produced larger shifts than 
did the later items. 

A third type of interview research centers on the basic questions of reliability 
and validity. Reliability of interviews includes both intrarater and interrater 
consistency. Intrarater reliability may be studied by having the same inter- 
viewer either reinterview the same persons at different times or inde ndently 
reappraise the applicants from sound recordings of the identical interviews 
after a time interval. When an over-all interview rating is computed from the 
ratings on individual items, odd-even correlations can also be found for each 
interview. Interrater consistency can be measured either by comparing the 
results of interviews of the same persons by different interviewers or by having 
different interviewers evaluate the applicants from sound recordings of single 
interviews. All of these procedures provide somewhat different information. 
Differences in the applicant's own behavior at different times or with different 
interviewers, for example, are eliminated as a source of variation when record 
ings of single interviews are used. 

_ Validity is sometimes determined globally, by checking final interview predic- 
tion against a composite criterion of job success. Other investigators analyze 


99 


Personal Appraisal and Biographical Data 


*24 — — Unfavorable information 


— Favorable information 


*20 P^ 

oer 

E 

"Accept" (A) or 5^ "Reject" (C) line 
D 


RS 
XS 


Ratings 


+8 


"Reject" (A) or "Accept" (C) start line 


0. 12 Jou Os eee oes 
Items 


Fig. 24 Shift in interviewer judgment in response to favorable and 
unfavorable items. Units noted on each curve represent total item weight 
of the 10 items in each protocol. (From Bolster and Springbett, 11, p. 99.) 


the validity of individual trait evaluations in terms of subsequent manifesta- 
tions of the same traits. It must be recognized that any general statements 
about reliability or validity of the interview are bound to be misleading. 
Interviews vary widely in both reliability and validity depending upon many 
factors, such as the form of the interview Ctraditional, standardized, or pat- 
terned), its duration and thoroughness, and the qualifications of the inter- 
viewer. Since the interviewer, like the clinician, is an integral part of the 
process, research designs on the reliability and validity of interviewing should 
take into account individual differences among, interviewers (32). 

In the light of the above considerations, it is not surprising to find marked 
variations in the reliabilities and validities of interviews reported in published 
surveys (3o, pp. 162-168; 50; 77). In general, early studies of “experienced” 
but untrained interviewers (usually sales managers) using the traditional, 
unplanned type of interview reveal very poor interrater agreement. Thus, in 
a group of 57 applicants for a sales position, the same applicant might be 
rated first by one interviewer and fifty-seventh by another! More recent 
studies of patterned interviews conducted by trained interviewers yield much 
more promising evidence of reliability and validity (33, 38, 50, 82). In one 
such study, for example, significant correlations ranging from .43 to .68 were 
found between interview ratings of predicted job success and subsequent 
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criterion measures (50). These validity coefficients were obtained on groups 
comprising from 84 to 587 employees in three companies; the criteria included 
foremen's ratings and turnover data. 


SUMMARY 


Application blanks, references, and personal interviews utilize combinations 
of biographical and interpersonal data. What the individual has done in the 
past is an effective predictor of future performance. Interpersonal evaluations 
provide appropriate indices for many social traits. Rating procedures, utilized 
for many personnel purposes, include order-of-merit comparisons, rating scales, 
scaled checklists, forced-choice technique, nominating technique, critical inci- 
dents, and field review method. Ratings are susceptible to halo effect, leniency 
and central tendency errors, and social stereotypes. They are influenced by 
amount and kind of acquaintance and by many rater characteristics. 

Letters of application and traditional application blanks are being replaced 
by weighted application blanks, found to be valid for a wide variety of jobs. 
Letters of recommendation are often too general and insufficiently discrimina- 
tive. Employment recommendation questionnaires are a potentially useful 
selection device but have been inadequately developed. Telephone checkups 
and field investigations provide fuller and more dependable information. 

Interviews are used not only for personnel selection but also for a number 
of supervisory and employee-relations functions. The selection interview 
provides a behavior sample and a reactional biography. With regard to form, 
interviews include the traditional, standardized, and patterned. Common in- 
terviewing errors, besides those shared with other rating procedures, include 
hunches based on irrelevant cues and contagious bias. Patterned interviews 
conducted by trained interviewers have yielded promising evidence of re- 
liability and validity. 
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Personnel Development 
and Tramng 


Selection and classification do not provide industry with all the skills it re- 
quires. Many of these skills must be learned by employees after they are hired. 
The emergence of new industries and the rapid changes in industrial opera- 
tions characteristic of modern society create especially heavy demands for 
training. When there is a manpower shortage in any field, moreover, improved 
and augmented training programs must make up for the necessary lowering 
of selection standards, The relative weight placed on selection and training 
also varies with the nature of the job. Typists, for instance, have gencrally 
learned to type before they are employed, while assembly-line operators are 
usually trained on the job. 

That psychology has much to contribute to training programs is apparent 
when we realize that the psychology of training is the applied psychology of 
learning. Learning theory is at the very core of the science of psychology and 
has been especially productive of well-designed research. Yet psychologists 
have not been so widely involved in industrial training as they have in per- 
sonnel selection. The missing link was a dearth of applied research in this 
area. Such applied research was needed to bridge the gap between the wealth 
of available basic research on learning and the vast number of practical situa 
tions where training was going on. It is only since World War II that 
psychologists have come to play a major part in training programs. The signifi- 
cant contributions of psychologists to military training during the war un- 
doubtedly did much to demonstrate their potential contribution to training 
problems in general (5, 81). Although the necessary applied research is now 
being conducted in industrial contexts as well, much of it is still done under 
military sponsorship. 


THE SCOPE OF INDUSTRIAL TRAINING 


Traditionally, training is differentiated from education in terms of breadth 
of goals. The primary objective of training is the acquisition of specific skills 
and information, as in learning to drive a truck, operate a lathe, or fill out a 
sales slip. Education is concerned with the development of more widely ap- 
plicable skills, knowledge, and attitudes, as in reading, solving arithmetic 
problems, understanding modern society, or enjoying music. Industrial train- 
ing, however, is being increasingly directed toward broader, long-range goals 
and is thus taking on more and more of the earmarks of education. This trend 
is exemplified by the increasing use of the term “development” in connection 
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with industrial training programs. The term "training" itself is coming to be 
used in a more comprehensive sense, to include development. Particular em- 
phasis is now placed on the role of training programs in the development of 
attitudes, especially with regard to interpersonal relations. However broadened 
in scope, though, industrial training is necessarily job-oriented. While recog- 
nizing that the development of the "whole person" may in the long run be the 
best way to train more effective workers, industry is still primarily concerned 
with improving job performance. 

Who Is to Be Trained? Training is not limited to the new employee. It is 
à continuing process from which old as well as new workers can benefit. Apart 
from the orientation and skill training provided for the incoming worker, 
training is a means of preparing promising workers for promotion to a higher 
job level. Such training is particularly important for promotion to a supervisory 
post, in which the employee must assume functions not covered by his present 
Job. Training may take the form of refresher courses for experienced workers, 
to ensure the continuance of effective work methods or to update information 
and procedures. 

A comprehensive and continuing training program is one way to maintain 
and improve operating procedures throughout the company. An important 
function of the training department is to make periodic surveys of training 
needs through analyses of production, accident, turnover, and other records; 
interviews and conferences with supervisors; questionnaire surveys of em- 
ployees; and other appropriate techniques (see 20, ch. 3; 45, chs. 2-4; 47; 75, 
pp. 252-261). Checklists have been developed to help training specialists to 
carry out such surveys. These procedures serve to locate departments or areas 
of operation in which training is particularly desirable. Similarly, the examina- 
tion of merit ratings and other individual evaluation techniques help to 
identify workers most in need of training. With regard to job level, the area 
of training is coextensive with the full range of jobs, from unskilled labor to 
top management. Widespread adoption of management development programs 
is, in fact, one of the chief characteristics of modern industrial training. 

Kinds of Training. The number and variety of training programs that 
may be found within a single company are very great. In duration, the train- 
ing may range from a one-hour orientation lecture to a four-year course in a 
company school. In large companies, each of the major functions Ci.e., market- 
ing, finance, manufacturing, engineering, and employee relations) often has 
its own training program. And, within each function, separate programs are 
available for many different activities. In employee relations, for example, 
there might be training programs on merit rating, interviewing, safety, wage 
and salary administration, and labor relations. k 

Most training programs for manual and clerical jobs are concerned chiefly 
with job skills. Training programs in technical and professional areas deal with 
many specialized techniques, from drafting to job evaluation. Cutting across 
training in the different company functions are management development 
programs for supervisory and executive personnel. Human-relations training 
plays a particularly important part in such management development programs, 
although it also enters into more specialized programs, as in the training of 
sales and employee-relations personnel. Because of the growing interest in 
management development, this type of training will be discussed more fully 
in a separate section of the chapter. 
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Another company-wide type of training is orientation. Also known as "induc- 
tion" or “indoctrination,” orientation training is designed to acquaint new em 
ployees with company practices, policies, and regulations. It should also 
provide information about the company and its products or services in such a 
way as to increase the employee’s identification with the company and to 
foster desirable attitudes toward the job. Although traditionally associated 
with incoming employees, orientation training is now regarded as a continuing 
process. Keeping all employees informed about the company through lectures, 
conferences, handbooks, employee magazines, and more informal personal 
contacts is the comprehensive aim of modern orientation programs. There is 
also an increasing emphasis on the development of attitudes, as contrasted with 
the imparting of routine facts. 

Some companies also provide general education in practically any field in 
which there is enough employee interest. As a service to employees, courses of 
a purely cultural or recreational nature—from sociology or ancient history to 
choral singing or contract bridge—may be offered in company schools. Certain 
employees may also be sent under company auspices to colleges or universities 
to complete educational requirements for promotion to higher-level jobs or 
possibly just to obtain the broad perspectives of a liberal education. 

Training Media. In view of the wide diversity of trainees, objectives, and 
content, it is to be expected that industrial training programs utilize many 
different media. Although two or more media are frequently combined and 
the distinctions among media are not at all sharply drawn, the most common 
training media may be considered under seven headings: (1) lectures, 
(2) training manuals and other printed materials, (3) films and television, 
(4) training devices, (5) job performance, (6) group-discussion and social 
interaction methods, and (7) problem-solving interviews. 

A lecture consists essentially of a single oral presentation of verbal material. 
Although usually given in person, it can also be presented through recordings, 
sound films, or television. In itself, the lecture does not provide for active 
participation by the learner, nor for repeated practice. Frequently it is used 
for preliminary explanation of procedure, which is then followed up with 
demonstration, practice, discussion, or other techniques. The lecture itself can 
also be supplemented with such training aids as charts, slides, or models. 
Lectures are often employed in orientation training to provide simple factual 
information such as plant organization or safety rules. Under certain condi- 
tions, lectures can be effective in modifying attitudes (31). 

Training manuals, handbooks, and other printed materials also present in- 
formation in verbal terms and—like the lecture—may include pictures and 
diagrams. Such printed materials are likewise suitable for orientation purposes. 
Unlike the lecture, however, they permit repeated exposure, since the trainee 
may consult the printed instructions as often as necessary. For this reason, 
this instructional medium is appropriate for the learning of long sequences or 
complicated procedures which could not be acquired in a single oral presenta 
tion. Training manuals are often combined with lectures and with practice in 
actual job performance. A training manual should be much more than a list 
of steps to be followed. Much can be done to make such a manual interesting 
to read and easy to understand. Learning principles can be utilized in its 
preparation, as in the development of any training procedure. 

Films, as well as closed circuit televisiom, may be used as a substitute for 
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the formal lecture and demonstration. When so employed, they share the 
advantages and limitations of the personally delivered lecture. They do, never- 
theless, provide uniformity of presentation in a large-scale program. And they 
permit maximal utilization of the best qualified instructors. Short films are 
frequently combined with ordinary lectures, as a training aid. 

Certain intrinsic features of the motion-picture medium make it particularly 
uitable for training purposes. Since they permit the controlled presentation cf 
visual stimuli in motion, films can be used in discrimination training, as in 
learning to recognize aircraft or read a radar scope. They are well adapted to 
learning a sequence of movements, The fact that the operation can be slowed 
down without altering its nature gives the film an added advantage over a live 
demonstration, where this is often impossible. The editing of a film by 
cutting and splicing may also increase its training effectiveness through focus- 
ing attention on important details, eliminating irrelevant material, altering 
sequences for increased comprehension, and similar modifications. 

Films can present a situation realistically, while avoiding the hazards of 
direct experience. For instance, a film can show the consequences of incorrect 
procedures, as in an automobile or airplane crash. Still another intrinsic ad- 
vantage of films arises from the use of the camera in the “subjective” point of 
view, ie. showing objects and movements as seen by the participant rather 
than by an external observer. This difference can be readily demonstrated if 
we try to teach someone how to tie knots, as illustrated in Figure 25. The 
picture on the left shows how the observer's own hands would look in tying 
the knot, while the picture on the right shows the instructor facing the ob- 
server. In experiments with naval trainees (65), the proportion who could 
correctly tie three kinds of knots after a single presentation of different train- 
ing films was compared. The results showed a significant difference in favor 


Fig. 25 Single frames from two training films on knot tying, illustrating 
the "subjective" (left) and "objective" (right) point of view of camera. 
CSee Roshal, 65; photographs by courtesy of S. M. Roshal.) 


portraying continuous motion led to significantly better learning than thos 
showing static shots of successive stages in the knot-tying process. 

In a still different connection, films have proved effective in the dramatic 
presentation of facts or events to aid understanding of abstract ideas and to 
modify attitudes. During World War II, for example, films were employed 
in the army’s orientation program on “Why We Fight” (31). 

In contrast to the previously discussed, "passive" training procedures, train- 
ing devices provide learner participation and repeated practice in actual job 
skills. They may be employed as a substitute for real equipment when use of 
the latter by an inexperienced operator might entail risk to materials or per- 
sonnel. Training devices may also isolate or recombine different job func- 
tions, so that the learner may spend more time on the more difficult or critical 
part activities. On the other hand, when operating equipment is relatively 
inexpensive and readily available and can be used without risk, it would be 
uneconomical to develop special training devices. There would be no justi- 
fication, for instance, in preparing a training device for learning to type. 

Although training devices call for the manipulation of equipment, they 
are by no means limited to the teaching of motor skills. Many deal with the 
understanding of operating principles and their application to problem solv- 
ing. Sensory discrimination, tracking skills, and complex sequences of motor 
responses may likewise be required. Because of the importance and growing 
utilization of training devices, they will be discussed in a separate section of 
this chapter, devoted to automated training. In that section will also be in- 
cluded the so-called "teaching machines" which provide automated, pro- 
grammed instruction with verbally presented materials. 

Even more direct than training devices is actual job performance. By this 
method, the new employee learns while actually working under the guidance 
of an experienced worker, a supervisor, or an instructor. On-the-job training 
is an age-old method for transmitting job skills. All too often it provides little 
more than unguided experience, including the learner's own trial and error, 
his undirected observation of experienced workers, and some casual, unsys- 
tematic instruction. Supervisors and experienced workers often lack the time, 
motivation, and teaching skills to provide the necessary instruction. It is well 
known that a successful performer is often a poor teacher. Traditional on-the- 
job training has been described as “sitting by Nellie,” since it involves little 
more than assigning the novice a place near an experienced worker (80). 

On-the-job training can be improved by training supervisors or other se- 
lected employees in effective instructional methods and by ensuring that they 
have enough time for training new workers. The substitution of systematic 
on-the-job training for traditional haphazard procedures has regularly led to 
marked improvements in speed of learning and quality of performance. ‘There 
is considerable research demonstrating that “experience” is not the best teacher, 
when it is in the form of unguided exposure to the task. Individuals do not 
work out the most effective procedures when left to their own devices. The 
conspicuous rises in production often found when experienced workers are 
put through a systematic training program demonstrate the same point. 

The vestibule school represents one way of increasing the effectiveness of 
job-performance training. Such a school may use regular job equipment, as 
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well as training devices and other instructional procedures. It provides training 
in a separate room, free from the distractions and pressures of the working 
environment. Instruction is conducted by a training-oriented instructor rather 
than by a production-oriented supervisor. Depending upon the skills to be 
taught, vestibule training may last from a few days to several months. 

Systematic job-performance training is also provided in apprenticeship pro- 
grams. Developed for the preparation of skilled craftsmen, such programs 
combine classroom instruction with directed shop practice. Detailed specifica- 
tions have been worked out for apprentice programs in different crafts, such 
as machinist or toolmaker, which may require as much as four years. Unlike 
the vestibule school, whose goal is the rapid teaching of one or more semi- 
skilled operations, the apprenticeship provides training for a total skilled job 
covering many different activities. 

Several related training procedures involving group discussion and social 
interaction have come into prominence with increasing interest in the develop- 
ment of human-relations skills. Like training devices and job-performance 
methods, they are characterized by active learner participation. The training 
activities involve many of the interpersonal skills required in supervisory and 
executive jobs, such as solving problems by group discussion, effective con- 
ference leadership, and face-to-face contacts with individuals in handling 
realistic supervisory problems. Because they are used chiefly in management 
development, these training methods will be discussed in the later section on 
that topic. 

Still another training medium is the problem-solving interview. Closely 
related to both merit rating and employee counseling, such interviews are 
held periodically by supervisors or training officers as a means of improving 
the job effectiveness of individual workers. Although applicable to all types 
of employees, this technique is most widely employed in management develop- 
ment and will be discussed further in that connection. 


DEVELOPING A TRAINING PROGRAM 


As in many other personnel functions, the principal role of the psychologist 
in industrial training is that of consultant. His major contribution is in the 
development, installation, and evaluation of training programs. He may also 
carry out applied research on training problems. And he may conduct special- 
ized training courses in such functions as interviewing, merit rating, or the 
human-relations aspects of supervision. Frequently, too, he “trains the trainers,” 
who then carry out the bulk of the company’s training in all areas. 

Like the development of personnel selection programs described in Chapter 
2, the development of training programs illustrates the fundamental scientific 
procedures of defining the problem, formulating hypotheses, and testing hy- 
potheses under controlled conditions. What is actually done in carrying out 
such a program can be summarized under three major headings: (1) task 
analysis, (2) development of training procedures, and (3) evaluation of train- 
ing program. 

Task Analysis. The purpose of the task analysis is to determine the 
content of the training program. This step tells us what must be learned. 
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Such a task analysis is similar to the job analysis conducted for selection pro- 
grams, in that both emphasize the skills needed to perform the job. Both begin 
with a full listing of operations to be performed and indicate which are the 
critical part activities (see Chapter 2). In task analysis, however, the opera- 
tions to be performed are generally broken down into finer units. Through 
task analysis, it should be possible to determine whether some activities re- 
quire little training because they are already in the learner's behavior reper- 
tory, which activities require the most intensive training, what is the most 
effective sequence for learning different tasks, and similar points useful in 
planning the training program. 

Many of the procedures followed in task analysis are similar to those de- 
scribed in Chapter 2 for use in job analysis (see 20, ch. 3; 23, ch. 7; 45, ch. 
3; 75, ch. 9). Observation of the job by the analyst and consultation with 
workers and supervisors are common procedures. Previously prepared check- 
lists and questionnaires are often used to facilitate task description. These 
techniques may be supplemented by examination of operating manuals and 
of available records and reports that might indicate causes of job failure and 
special sources of difficulty. The critical-incident technique has also been 
used in this connection (see, e.g., 25). 

Factor analysis can help in identifying the skills that are of primary im- 
portance in different stages of learning a task. The value of this approach was 
demonstrated in an investigation of a complex sensorimotor task similar to 
that required in flying an interceptor aircraft (61). Factorial analyses of this 
training task had shown that performance in early practice sessions depended 
largely on spatial orientation, while performance in later sessions depended 
increasingly on multilimb coordination. On this basis, an experimental train- 
ing program was devised in which the oral instructions to trainees focused 
on different aspects of the operation at different learning stages. Performance 
records showed that the group taught by this method learned significantly 
better than a group taught by conventional methods. This investigation illus 
trates the sort of contribution that task analysis can make to training proce- 
dures themselves. 

Development of Training Procedures. Once we know what must be 
learned, the next step is to determine how it should be taught. This question 
concerns not only the choice of training media but also decisions regarding 
many other aspects of training procedure, such as how to motivate the learner, 
how to distribute learning over time, how to inform the learner regarding his 
performance and progress, and how to ensure maximal transfer of training 
to job performance. The answers to all these questions depend in part on the 
content of learning. They also depend upon learner characteristics. Training 
research should be so designed as to permit an analysis of the interaction of 
method with both content and subject variables (11, 16). The relative effec 
tiveness of different training methods may vary with the age, education, job 
experience, and intellectual level of the persons to be trained. It may also 
vary more specifically with the individual's aptitude pattern. Thus some per- 
sons may learn better through techniques requiring primarily spatial visualiza 
tion, others through techniques relying chiefly on verbal comprehension. The 
qualifications of the available instructors who will conduct the training may 
also be a determining factor in choosing procedures. ' 
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In his development of training procedures, the psychologist is also guided 
by certain learning principles. Derived from basic learning theory, these 
principles are supported by applied research conducted in industry and the 
armed services (see 23, ch. 7; 44; 45, chs. 5 and 6; 81). It is well estab- 
lished, for example, that motivation helps learning. Such motivation may be in 
the form of higher pay, security, promotion, other forms of recognition or 
status, or even the satisfaction of succeeding on the job. To be effective, the 
anticipated reward must be one that is significant for the particular learner. 
In addition, the training itself may provide intrinsic motivation through defi- 
nite and realistic goals. If the learning is broken down into units and if 
performance standards are set neither too high nor too low, the individual's 
feeling of accomplishment as he achieves each goal motivates him to continue 
learning. Unrealistic goals, on the other hand, may produce frustration and 
resentment and retard learning. 

Learning is also hindered by stress, which may result from anxiety, em- 
barassment, distraction, or confusion. Stress is especially detrimental in the 
learning of complex functions requiring discrimination, problem solving, and 
flexibility of approach (35). Freedom from stress is one of the advantages of 
the vestibule school as compared with training on the job. After essential 
skills have been mastered, the worker is better able to cope with the more 
stressful working environment. 

The familiar adage, "Practice makes perfect," is well grounded in learning 
theory and has several implications for the management of training. Active 
participation by the learner is a prerequisite for effective learning. Elaborate 
visual aids and impressive demonstrations may be of little use unless they 
lead to some action on the part of the learner. Moreover, the task should be 
practiced as it will eventually be performed. Verbal description of a sequence 
of movements, for instance, is no substitute for carrying them out. At the 
same time, the learner should have sufficient guidance, especially in the early 
stages of training, to prevent the practicing of errors. Practicing the wrong 
response makes it necessary to unlearn it later on. It is for this reason that 
correctness and accuracy are generally emphasized in the early stages of train- 
ing, speed in the later stages. This approach is effective except when reduc- 
tion of speed alters the nature of the responses. In that case, the learner would 
not be practicing the responses he ultimately needs to make. 

One of the best-established learning principles pertains to feedback, or 
knowledge of results (58, 81). Letting the learner know how well he is do- 
ing serves the dual function of motivating him and of identifying and thus 
differentially reinforcing the correct responses. Such reinforcement is more 
effective if it is immediate and specific. With increasing delay, the effect 
of the reinforcement drops off rapidly. Precise information about nature and 
extent of errors produces faster improvement than being simply told whether 
a response is right or wrong. 

The relative merits of part versus whole learning have been repeatedly 
investigated. Performing the whole task during each trial has the advantage 
of affording practice in the activity that is ultimately required. In many situa- 
tions, whole learning also makes the task more meaningful and permits the 
proper establishment of relations among parts. In many industrial tasks, how- 
ever, some operations or units are more difficult to learn than others. Hence 
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uniform practice by the whole method would lead to overlearning of some 
parts and underlearning of others. Under such conditions, some combination 
of part and whole methods usually proves best (40, 71). 

To be of practical use, industrial training must transfer to job performance. 
A trainee may become highly proficient in the use of a training device; a 
supervisor may exhibit sophisticated human-relations attitudes and skills in the 
course of group-discussion training; but if these newly learned responses are 
not manifested in improved job performance, the training has failed. Learning 
research suggests that transfer of training is more likely to occur between 
situations that have many identical elements in common. This again high- 
lights the desirability of making the training tasks as similar as possible to 
the job task. Another way of increasing transfer is to provide a variety of tas! 


KS 
in the learning situation. Such variation helps the individual to generalize 
his learning to new situations. Variation in the irrelevant parts of the stimulus 
situation forces the learner to respond to its critical aspects. Finally, giving the 
learner an understanding of principles facilitates transfer. But merely stating 
a principle is not enough. A more effective way is to have the learner work 
out the principle for himself by observing its application in a number of dif- 
ferent situations. The relative merits of these different techniques for achiev- 
ing transfer depend to some extent on the nature of the task (see, e.g., 69). 

The above discussion is not intended as an exhaustive survey of the ways 
in which available learning research may guide the development of training 
procedures, It merely illustrates how the psychology of learning can be applied 
to industrial training. The list of principles could readily be expanded; and 
many more detailed findings could be cited regarding the application of any 
of the given principles (21, 44, 81). Moreover, the usefulness of any of these 
principles in a specific training situation cannot be assumed but must be 
determined through task analysis and through applied research within the 
particular context. Practicing the final task, for example, is not always the 
most effective learning technique. Thus in learning a procedural sequence, 
such as is required for routine maintenance operations, the individual activi- 
ties of turning dials, throwing switches, and the like are already in the learn- 
er’s repertory. What must be learned is the identification of parts to be 
manipulated and the sequence of acts to be performed (21). Other operations, 
such as "troubleshooting," or diagnosis of malfunction, require the learning 
of rules and concepts, rather than the practice of a given set of tasks. 

Evaluation of Training Program. Like selection programs, training pro 
grams should be objectively and systematically evaluated. We cannot assume 
that training improves job performance. Nor can we rely on expressions of 
opinion, as is done in many companies (47). Workers and supervisors may 
believe they have benefited from training without manifesting any genuine 
improvement. To evaluate a training program, we need to compare the job 
performance of the same group before and after training, or the perform 
ance of a trained group with that of a control group, or the performance of 
two or more groups trained by different methods. Like the content of training, 
the criteria for evaluating its effectiveness must be derived from a job analysis. 
Whether or not training was successful can be determined only in the light 
of the specific objectives of the training program. 

Several sources of criterion data are available for evaluating the effective 
ness of training (20, ch. 8; 37; 75, ch. 9). For many jobs, a major criterion 
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is increase in productivity, as indicated by average hourly output, time re- 
quired per unit task, total amount of sales, and other indices of amount or 
quality of work. Breakage and use of consumable supplies may also be an 
important criterion in some types of jobs. 

A combination of these criteria was employed by Lindahl (40) in evaluating 
a specially developed training program. The job was that of cutting small 
tungsten disks with a machine whose successful operation depends upon the 
speed, form, rhythm, and pressure pattern of the operator's hand and foot 
action. Through a careful task analysis with specially devised recording equip- 
ment, Lindahl was able to get a graphic record of the proper pattern of foot 
movement. This pattern, reproduced in Figure 26, was then used as a stand- 
ard in training operators. The recording instrument, which provided con- 
tinuous knowledge of results, served as a training device. The progress made 
by one trainee is illustrated in Figure 27. It will be seen that after 239 hours 
the trainee's foot action closely approximated the standard pattern. 

Weekly production records showed the effectiveness of this systematic train- 
ing as compared with unguided job experience. After 11 weeks of training, 
the new employees reached the production level of untrained employees who 
had been on the job 5 months. Training also brought about considerable sav- 
ing in costs by reducing the breakage of abrasive cutting wheels. As can be 
seen in Figure 28, after only 5 weeks of training, the trained group was 
breaking fewer wheels than did untrained employees with 5 months of job 
experience; by the tenth week, the trainees excelled the workers with 9 
months of job experience. 

Other criteria of training effectiveness include number of accidents, absen- 
teeism, and labor turnover. In a follow-up study reported by Lawshe (37), 7o 
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Fig. 27 Disk-cutting foot-action patterns of a 
trainee after 9, 45, 141, and 239 hours of super- 
239 hours. vised operation. (From Lindahl, 40, p. 425.) 


operators who had attended a vestibule school were compared with 7o trained 
"on the floor.” During the first 30 days on the job, the vestibule-school group 
had 19 per cent fewer accidents, 51 per cent fewer absences, and 55 per cent 
less turnover than the control group. 

Certain types of criterion data may be obtained in the course of training. 
Thus different training procedures may be compared with regard to rate of 
learning or percentage of operators who meet specified performance standards 
within the training period. During World War II, Keller (33) developed an 
improved method for teaching Morse code to radio operators. The new method 
provided immediate knowledge of results and utilized “whole learning” of the 
complete set of letters and digits. The superiority of this training method 
over the previously used army method was indicated by two training measures. 
The average amount of training time required to reach the standard of five 
words per minute was reduced from between 35 and 40 hours to 27 hours. 
Similarly, only 3.4 per cent of the trainees taught by the new method failed 
to attain the required speed, in contrast to 15 per cent of those taught by the 
old method (see, 81, p. 1269). 

Training criteria were also employed in evaluating a program developed in 
England for training shoe machinists (71). The training was conducted in 
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separate schools by trained instructors. Effective work methods were first 
developed through a careful task analysis. Target times and quality standards 
were also empirically established for different training stages. The training 
was conducted by a progressive-part method, which involves the independent 
learning of different units and their subsequent practice in combination with 
previously learned units. Films, filmstrips, and training devices were employed. 
Through these various improvements, the required training period was re- 
duced from one year to between six and eight weeks. Moreover, the new 
trainees earned more when first employed on piece rate than did experienced 
workers trained by unsystematic on-the-job procedures. 

Objective written examinations administered upon the completion of train- 
ing are a further source of criterion data. Such measures are appropriate 
when the objective of training is to impart technical information or other 
factual content such as details of company policy, union contracts, and labor 
regulations (see, e.g., 64). Insofar as such factual knowledge may be one pre- 
requisite for success as a foreman, for example, performance on the examina- 
tion is an index of training effectiveness. 

Other aspects of job performance, however, may bear little or no relation 
to performance in training or to the acquisition of factual knowledge. In the 
application of human-relations skills, for instance, it is particularly important 
to obtain follow-up data on actual job performance. The effectiveness of super- 
visory training in general may be evaluated through the job performance of 
the men who work under the supervisor. Since the foreman "works through 
people," the ultimate success of his training should be manifested in improved 


Fig. 28 Decrease in breakage rate of trainees, in comparsion with average 
breakage rates of present employees after 2, 5, and 9 months of job 
experience. (From Lindahl, 40, p. 430.) 
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job performance by his subordinates. Any of the previously mentioned indices 
of job performance could be utilized for this purpose. 

An approximation of such ultimate criterion data of supervisory success may 
be obtained through questionnaires filled out by the men in each supervisor's 
work group. This procedure was followed in a well-designed longitudinal in- 
vestigation conducted at the International Harvester Company by Harris and 
Fleishman (30). A random sample of workers in each foreman's group com- 
pleted a Supervisory Behavior Description on two occasions, 11 months apart. 
During this period, 39 of the foremen had attended a two-week course in 
leadership training conducted at a central company school; the remaining so, 
who had received no such training, constituted the control group. On the 
Supervisory Behavior Description, each worker indicated how often his fore- 
men did what the item described, such as planning each day's activities in 
detail, insisting that everything be done his way, or helping his men with 
their personal problems. These questionnaires were scored for two independent 
leadership dimensions, which had been identified through factor analyses in 
earlier research. One of these dimensions was described as "Consideration," 
representing a friendly, warm, and considerate supervisory relation, as con- 
trasted with an impersonal and authoritarian relation. The other dimension, 
designated as “Initiating Structure,” concerned the extent to which the super 
visor actively plans and directs group activities oriented toward goal attain 
ment. 

A comparison of the mean scores obtained by the foremen in both leadership 
dimensions on the two occasions revealed no significant change in supervisory 
behavior in either the trained or the control group. In an earlier study con 
ducted in the same company, on the other hand, significant differences had 
been found in the foremen's supervisory attitudes as reported in a question- 
naire filled out by the foremen themselves at the beginning and end of train- 
ing (17). However, a comparison of matched groups of foremen who had 
been back on the job for varying periods showed no significant mean differ- 
ences between trained and untrained groups in either attitudes or behavior. 
These findings suggest that what was learned in training did not transfer to 
the plant situation. 

Through an ingenious set of additional questionnaires designed to elicit the 
attitudes and expectations of the foreman’s own supervisor, as well as the 
foreman's description of his supervisor's behavior and his perception of what 
his supervisor expects, Fleishman was able to demonstrate that the “leadership 
climate” in which the foreman operates is a major determinant of his own 
attitudes and behavior. Moreover, the effect of training varied for individual 
foremen, probably as a function of the foreman’s pretraining characteristics 
and of the leadership climate under which he worked. Although training did 
not produce significant group differences in job behavior, it led to greater 
shifts in the attitudes and behavior of individual foremen than were found 
in the untrained control group. It thus appears that training may have made 
lasting changes in attitudes and job behavior but that the interaction of train- 
ing effects with other variables prevented the identification of such changes 
in group means. One practical implication of these findings is the importance 
of conducting supervisory training from the top down. To train foremen 
who will then resume their duties in a leadership climate inconsistent with 
their training may be worse than useless. 
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AUTOMATED TRAINING 


An automated trainer usually represents a whole training program, in which 
the instructor's role is minimized. In this regard, such trainers differ from 
training aids and other equipment used by the instructor for demonstration 
purposes. The design of an automatic trainer should incorporate principles of 
efficient learning, such as those discussed in the preceding section. Among the 
chief advantages of available trainers are their provision for active learner 
participation and for immediate feedback. On the other hand, insofar as a 
trainer may differ from the job itself either in the operations required or in 
the surrounding working conditions, transfer of training to the job situation 
may be reduced. Even tasks that are superficially alike may not be interchange- 
able for training purposes. In an investigation of typing, for instance, transfer 
of training from electric to manual typewriters averaged only 58 per cent (1). 
The extent of transfer needs to be empirically demonstrated for each trainer. 

Available automatic trainers are of three major types: training devices, 
simulators, and teaching machines. Each will be briefly discussed and illus- 
trated in the following sections. 

Training Devices. Designed to develop essential job skills, training devices 
may isolate operations and alter their sequence to increase training effective- 
ness. Training devices vary in their complexity, degree of dependence upon 
an instructor, and closeness with which they reproduce the job situation. They 
also differ in the type of functions in which they afford training. Among the 
best-known training devices are those providing practice in motor skills. An 
example is the flight trainer used to train student pilots in instrument flying. 
One model of flight trainer is illustrated in Figure 29. Without leaving the 
ground, the student performs all necessary operations for specified flight ma- 
neuvers and receives immediate feedback on the effects of each action as 
though he were in actual flight. Several investigations have shown that such 
ground training transfers well to aircraft flying and can effect considerable 
saving in costly aircraft training time (15, 78). There is also evidence to sug- 
gest that simple types of trainers are about as effective for basic training as 
complex trainers that reproduce aircraft features more fully (77). ) s 

A more specialized type of trainer, providing practice in visual tracking, is 
the flexible gunnery trainer (27). This device was used by the Air Force dur- 
ing World War II in training gunners to fire from a bomber at an attacking 
fighter aircraft. The trainer includes an actual sighting and gunnery station, 
which is used by the trainee in tracking and triggering. Attacking airplanes 
are shown by projection on a screen. As the trainee operates the equipment, 
he receives immediate knowledge of results regarding hits. A score based on 
time-on-target during each attack can also be obtained. Other trainers have 
been designed for bombardiers, navigators, radar operators, submarine crews, 
and other military specialists. Some utilize techniques that are applicable to 
civilian jobs. Research on these devices is thus of more general interest than 
their military uses suggest. N d P 

Training devices may also be used for conceptual training. This application 
can be best illustrated by the devices developed for training technicians in 
the maintenance of complex electronic equipment (22, ch. 12). Job analyses 
indicate that such a technician needs to learn a variety of facts, procedures, 
and concepts. He must be able to identify and locate many objects and parts 
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Fig. 29 A flight trainer used to provide basic training in instrument 
flying. (Courtesy of Link Division, General Precision Inc.) 


in the equipment. He must also learn to follow many fixed sequences of opera- 
tions. In some of these identifications and actions, he must rely on memory; in 
others, he uses charts, guides, and checklists with which he must be familiar. 
Even more important is his mastery of concepts and rules, which provide a 
basis for problem-solving activities that cannot be specified in advance. In 
the same connection, he must acquire some understanding of the principles 
and functioning of electronic circuits, as well as a knowledge of what the 
specific equipment is designed to do. 

Figure 3o shows one training device developed by the Air Force to train 
technicians in troubleshooting electronic equipment (19). On the front panel 
are reproduced in miniature the various components, controls, and check 
points of a complex electronic system. A typical malfunction can be set into 
the system by operating one of the many switches on the side panel. The 
trainee must then manipulate controls and carry out various checking opera 
tions to trace the source of the malfunction. Use of this trainer requires an in 
structor to inform and guide the student when he makes illogical or unneces 
sary checks. 

The combination of equipment illustrated in Figure 31 includes both an 
equipment-component mock up (on the left) to be manipulated by the trainee 
and a microfilm projector Con the right) which provides instructions (41). 
By presenting problems and auxiliary information in this form, such a trainer 
permits greater flexibility than is possible with the type of trainer illustrated 


in Figure 30. A single device can thus be used to provide training for a wide 
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Fig. 30 A training device for training technicians in troubleshooting 
electronic equipment. (See French and Martin, 19; photograph by 
Maintenance Laboratory, Air Force Personnel and Training Research Center.) 


scope of operations. The results of the trainee's manipulations may also be 
fed back into the programming of the microfilm projector. Hence the instruc- 
tions can be varied in accordance with the trainee's own previous responses. 

A high degree of automation may also be achieved by constructing the 
training device itself so that it provides performance guidance, feedback, and 
recording (6; 8; 22, ch. 12). Written instructions may be presented by means 
of a roll chart. The system may be so designed that it "freezes" when the 
trainee makes an error and will not “unfreeze” until the correct move is made. 
Similarly, signal lights may flash on to inform the trainee that he has followed 
an incorrect or illogical procedure. 

Simulators, The distinction between trainers and simulators is not a sharp 
one. Some of the training devices discussed in the preceding section could be 
classified as simulators. Essentially simulators reproduce job characteristics 
more closely than do trainers. Their design emphasizes realism in equipment 
and operations. As a result, simulators are highly specific. For example, flight 
simulators such as that illustrated in Figure 32 are usually built for each new 
major aircraft. These simulators reproduce with a high degree of fidelity the 
instrument panels in the cockpit, the "feel" of the controls, the sound of the 
engines, cockpit motion, and any other characteristics perceptible to the 
trainees. In the simulator shown in Figure 32, the runway is reproduced on 


PERSONNEL PSYCHOLOGY 


Fig. 31 "Troubleshooting trainer with microfilm projection device for 
presenting instructions. (See Lumsdaine, 41, p. 86; photograph by 
Maintenance Laboratory, Air Force Personnel and Training Research Center.) 


closed-circuit television in order to provide realistic practice in takeoffs and 
landings. The appropriate visual cues can be seen through the cockpit win- 
dow. 

Simulators are generally utilized for “proficiency training,” that is, for ad- 
vanced stages of training or for the maintenance of a high level of job per- 
formance in experienced operators. Flight simulators are regularly employed 
by major commercial airlines as well as by the Air Force. One function of 
these simulators is to give experienced pilots periodic practice in handling 
emergencies which would rarely be encountered during ordinary operations. 
Another function is to introduce the pilot to a new type of airplane. Before 
practicing in the air, the experienced pilot may spend several hours operating 
the simulator for the new airplane. This simulator practice enables him to 
transfer the skills and concepts he has acquired in previous flight training and 
experience to the operation of the unfamiliar airplane. 

Simulators are available for several other types of jobs, although their use 
has been limited largely to the armed services. In industry, they have so far 
found little application outside of aviation. Unquestionably, their greatest 
value is in jobs where use of regular equipment for training purposes would 
be hazardous. Both simulators and training devices have proved especially 
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Fig. 32 A flight simulator for advanced proficiency training. (Courtesy 
of Link Division, General Precision Inc.) 


useful in the preparation of astronauts for space flight (59). The procedures 
trainer illustrated in Figure 33 was employed in training astronaut crew 
members with actual spacecraft controls and displays in normal and emergency 
operating procedures. It is a fixed-base trainer providing instrument environ- 
ment only. The instructor's console permits the insertion of different problems 
and the simulation of. ground facilities such as the launch control center and 
remote sites. 

Teaching Machines. Although any automatic instructional device could 
logically be called a teaching machine, the latter term has come to be associated 
with verbal learning (22, 42, 70). Such learning may range from rote learn- 
ing, as in spelling or foreign vocabulary, to complex conceptual learning, as 
in mathematics or psychology. The teaching machine need not be a machine 
in the popular sense of "hardware." Some do use apparatus to present stimuli 
and record responses. But others require only printed materials, such as a 
specially prepared book or even a set of mimeographed sheets. 

An important feature shared by different types of teaching machines is 
commonly known as programmed learning. Programming involves essentially 
the preparation of a series of teaching steps through which the learner is 
guided in a systematically established sequence. The sequence may be uni- 
form for all learners, or it may follow different routes, depending upon the 
learner’s own responses. Once the program, or instructional content, has been 
prepared, it can be presented to the learner through various media, ranging 
from programmed textbooks, through films or specially developed mechanical 


gadgets, to electronic computers. Although the mechanics of the process have 
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Fig. 33  Astronaut John H. Glenn, Jr., using procedures trainer. (Courtesy 
of National Aeronautics and Space Administration.) 


received extensive publicity, it is the program that is of prime importance. 
Unfortunately, the development of programs has lagged far behind the de- 
velopment of hardware devices. 

From the standpoint of the psychology of learning, teaching machines have 
much in common with the previously discussed training devices. Both require 
active learner participation and provide immediate feedback after each re- 
sponse. Both allow for individual differences insofar as the individual may 
progress at his own pace. Because they deal with verbal learning, however, 
teaching machines usually present far greater complexities of programming 
than do training devices. 

The possibilities of automated teaching have received widespread attention 
in education, where programs have already been developed in many areas of 
instruction from the preschool to the college level. Such automated teaching 
programs will be discussed more fully and illustrated in Chapter 18, dealing 
with the contributions of psychology to education. That industry is interested 
in the potentialities of teaching machines is suggested by the many confer- 
ences, seminars, institutes, and workshops on teaching machines conducted by 
various organizations, including the American Management Association. A 
discussion of teaching machines from the standpoint of industrial training 
directors can be found in the articles by Levine and Silvern (39) and in the 
book edited by Margulies and Eigen (52). Current bibliographies on the 
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whole area of automated learning are published in each monthly issue of the 
journal AID CAuto-Instructional Devices), founded in April, 1961. 

Teaching machines have been incorporated into the ongoing training pro- 
grams of a number of companies in such varied fields as manufacturing, life 
insurance, public utilities, and retail selling. Little research has been done, 
however, to test the effectiveness of such procedures. One of the few published 
studies on this question compared the use of a programmed text with con- 
ventional classroom instruction in a course on a data-processing system given 
to IBM computer servicemen (32). The classes that used the programmed 
text required 24 per cent less classroom teaching time, reported less time spent 
in home study, and obtained significantly higher scores on a terminal exami- 
nation than did the control classes. 

The armed services have also been making increasing use of teaching ma- 
chines, along with other types of auto-instructional devices (42). In connection 
with the previously cited training procedures for electronic maintenance 
technicians, for example, two simple devices for verbal training were devel- 
oped (7; 9; 22, ch. 12). Both employed a multiple-choice technique for teach- 
ing factual elements and for “verbal pretraining” in the memorizing of short 
series of operations. Of particular interest is the possibility of combining sev- 
eral media into an instructorless and programmed course for group instruction. 
This approach is illustrated in a combined tape-recording, film, and workbook 
package designed by the Air Force to teach English to aviation personnel in 
foreign countries (22, ch. 13). Although suitable for group instruction, this 
course provided active learner participation through individual workbooks. 
The instructional materials were programmed as in other teaching machines. 
By demonstrating the adaptability of the teaching machine to group instruc- 
tion, this project opens up possibilities for the wider utilization of teaching- 
machine principles. 


MANAGEMENT DEVELOPMENT 


Management development takes many forms, from personal counseling to sub- 
sidized university courses (20, chs. 7 and 9; 55). In content it may range 
from reading skills to interpersonal attitudes. The manager needs thorough 
and continuing orientation regarding company policies; he needs. supervisory 
skills in handling the many face-to-face situations encountered in his daily 
work; he needs knowledge of scientific management principles for effective 
planning and organizing; in some departments, such as engineering, he obvi- 
ously needs to keep up with the latest technical and scientific developments 
in his field; and he needs the imagination and breadth to take a comprehensive 
and long-range view of company functions. Not all these needs are equally 
strong at all levels. In general, supervisory functions assume more importance 
at the lower management levels. They play a particularly important part in 
the job of foreman, or first-line supervisor. At the higher or executive levels, 
planning, organizing, and creativity become increasingly more important. 
Among the many types of management training programs, three are of 
special interest. These are concerned with human-relations training, manage- 
ment games, and creativity training. The first has been chosen for discussion 
because it covers a major portion of current management training. More has 
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undoubtedly been written about human-relations training than about iny 
other phase of management development. The other two, on the other hand, 
represent no more than promising ventures into relatively unexplored territory. 

Human-relations Training. Employee opinion surveys indicate that the 
greatest felt need for supervisory training is in the area of human-relations 
skills and interpersonal attitudes (4, 14). Such results were obtained when 
workers were questioned about the training they believed their supervisor 
should have, as well as when foremen were queried about their own training 
needs. The principal techniques currently employed for human-relations train- 
ing include individual interviews, as well as various group procedures. What- 
ever the training technique, however, its effectiveness depends upon the 
leadership climate in which the supervisor must function. In other words, the 
supervisor's behavior toward his subordinates is itself affected by the behavior 
of his own supervisor toward him. The importance of this factor was demon- 
strated by the previously cited International Harvester study (17, 30). Similar 
results have been obtained in other industrial research in which lead: rship 
climate was investigated (see 47). 

The use of the problem-solving interview as an instrument of human-rela- 
tions training has been fully described by Maier (49). If the individual seeks 
help because of a personal problem or job difficulty, the interview or series of 
interviews conducted to solve the problem would usually be considered 
counseling. The appraisal interview, on the other hand, is either initiated by 
the individual's supervisor because of a perceived job difficulty or is held 
periodically as a regular supervisory procedure. Essentially, the object of all 
such interviews is to give the individual an Opportunity to bring up, talk 
about, and work out solutions for his own problems. The interviewer serves 
chiefly as an “active listener.” His function is to create a permissive climate 
conducive to free discussion and constructive problem solving and to help 
clarify the individual's own feelings. Since this type of interview is patterned 
closely after the “nondirective interview” developed in clinical psychology, 
it will be discussed more fully in Chapter 14. Illustrations of its use in in- 
dustrial contexts can be found in Maier’s book (49). Problem-solving inter- 
views with top executives represent one of the growing functions of the con- 
sulting psychologist in industry (see, e.g., 24). 

In the course of solving his problems in the interview, the supervisor also 
learns how to use the interview as a supervisory tool with his own subordi- 
nates. This dual function runs through much of human-relations training. 
Such training is designed to give the individual, not a set of ready-made solu- 
tions or rules, but a method for solving interpersonal problems. It is also de- 
signed to induce the individual to become active in his own self-training. 
Another important objective is the development of sensitivity to the feelings 
and attitudes of others. Thus the supervisor must learn to respond to slight 
cues in the behavior of others which may suggest underlying reasons for what 
they say and do. 

All of these objectives are illustrated in the various supervisory training 
techniques that utilize group discussion and social interaction (20, ch. 7; 50; 
51). As used in this context, “discussion” means active individual participation 
and group interaction in reaching decisions. It does not refer to the question- 
and-answer type of discussion that often follows a formal lecture. The dis- 
cussion methods used in human-relations training provide direct practice in 
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interpersonal relations, group problem solving, conference leadership, and other 
skills needed in a supervisory job. The participant has a chance to observe 
feelings and attitudes as manifested in his own and his associates’ behavior. 
He may discover that the same objective events are differently perceived by 
different persons. And he should learn to recognize and avoid snap judgments, 
pat solutions, and stereotypes in solving human-relations problems. 

Several special techniques and training aids have been developed to in- 
crease the effectiveness of discussion methods for these purposes. One is the 
case method, whereby the group is given specific cases to discuss. Each case 
provides a description of a realistic human-relations problem for which the 
group undertakes to work out a solution. The many collections of cases that 
have been published for this purpose bear witness to the popularity of the 
method (e.g, ro, 12, 18, 26, 29, 34, 36, 46, 5o, 63, 67). A variant of the 
case method, designed to increase active participation by discussants, has been 
described as the "incident process" (63). In this method, the case is introduced 
by reporting a specific incident, about which the discussants then ask ques- 
tions to elicit background facts. When sufficient information has been accumu- 
lated, the discussants proceed to summarize the case, identify the problem, pro- 
pose courses of action, and evaluate the relative merits of the proposals. 

A much less structured approach is illustrated by a technique commonly 
designated as sensitivity training (56, 66). A group of about a dozen trainees 
spend a short period (usually a week) living in a relatively isolated place, 
with a minimum of outside contacts and distractions. The group leader under- 
takes in a variety of ways to induce an atmosphere of frank and uninhibited 
airing of problems and feelings in group sessions. In this context, the partici- 
pants are encouraged to study their own reactions and those of other group 
members as they occur. Having much in common with group psychotherapy 
(to be considered in Chapter 14), sensitivity training is designed to increase 
insight into oneself and others, promote acceptance of self and associates, and 
improve interpersonal behavior. 

Some sensitivity training groups include only a single person from each 
organization, on the assumption that greater freedom of expression is likely 
to be achieved among strangers. Under these conditions, however, the single 
participant may find it difficult to apply his newly learned approaches upon 
returning to his company. Moreover, important interpersonal issues may not 
emerge in a group of strangers and there is little opportunity for coming to 
grips with existing conflicts and interpersonal problems within each organiza- 
tion. For these reasons, an increasing number of practitioners of sensitivity 
training prefer to work with a “family group,” that is, a group of persons who 
actually work together in a particular company. 

Another technique, which can be combined with either the case method 
or sensitivity training, is role playing. First developed by Moreno (57) as a 
clinical and counseling procedure, role playing has been extensively employed 
in management development by Maier and others (5o, 51). In its basic form, 
role playing requires individuals to assume designated roles and enact a 
human-relations incident. A typical case may be presented orally, in written 
instructions, on a film, or through a dramatized version with prepared script 
and dialogue. The prepared material stops at the point where the problem has 
fully developed. Beyond that point, the trainees must carry on in their own 
way, assuming the roles of workers, foreman, or whatever the case requires. 
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Each individual, however, is instructed always to play himself, that is, to act 
as he would if placed in that situation. Other group members serve as ob- 
servers. Following the role playing, the entire group discusses what was done. 
The number of role players in different cases may vary from two to a dozen 
or more. Trainees take turns in assuming the roles of employees, supervisors, 
and observers. ‘ 

Another form of role playing worked out by Maier and his associates (50, 
51) is multiple role playing (MRP). This technique is suitable for use with 
large groups. First, the group is subdivided into subgroups of six persons, by 
requesting that every three persons in alternate rows turn around and meet 
with the three persons directly behind them.’ By means of written instruc- 
tions, the six members of each group are then assigned roles to play in an 
ensuing decision-making discussion. Each group receives identical instructions. 
Besides affording every individual in a large group an opportunity at role 
playing, this method has the further advantage of generating data of intrinsic 
interest to the group. Thus at the end of the discussion the solutions reached 
by each subgroup can be tabulated and compared. Such solutions will usually 
vary because of differences in the personalities of group members. 

The MRP technique also lends itself to research on the effect of various 
factors on group decision making. One investigator, for example, wanted to 
see how training in discussion leadership would affect group behavior (48). 
About half of the MRP groups in this study consisted of persons who had 
attended a short, intensive course in discussion leadership; the other half con- 
sisted of persons who had received no such training. An analysis of outcomes 
showed that 50 per cent of the untrained groups, as compared with only 
4% per cent of the trained, had failed to reach a decision that was acceptable 
to all group members. Differences in the nature of the solutions also reflected 
the influence of the preceding training. 

Other studies evaluating the effectiveness of human-relations training pro- 
cedures generally report significant improvement in attitudes, sensitivity, and 
problem-solving behavior when the trainees are tested in the training situation 
itself (e.g., 38, 72). Few studies have obtained any evidence of transfer of 
training to the job situation. As noted in an earlier section of this chapter, such 
follow-ups as have been conducted have failed to reveal any significant over-all 
change in job behavior. Situational factors, such as leadership climate, ap- 
parently determine the direction and extent of changes resulting from human- 
relations training. 

Management Games. Not all management decisions deal with inter 
personal relations. It is in handling personnel problems that the previously 
described group problem-solving methods are likely to prove most effective. 
Decisions reached jointly by all persons concerned are more likely to fit all the 
idiosyncracies of those particular individuals than would decisions reached by a 
single person. Such a joint decision will also tend to be more acceptable to the 
group than would a decision imposed from without, however reasonable the lat- 
ter decision may be when evaluated objectively. Executives, however, are called 
upon to make many other kinds of decisions, concerning other matters besides 
personnel relations. Often these decisions require specialized knowledge and 
1 This procedure resembles in several ways a method known as “Phillips 66," whereby 


a large group is divided into committees of six, each of which discusses some assigned 
question for six minutes (62). 
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skills that only a few members of the organization may possess. Reaching a 
correct decision in such areas is not merely a matter of choosing a course of 
action that satisfies all group members. Other procedures are required to enable 
the executive to balance different company objectives, to secure and evaluate 
relevant information, and to predict the probability of different outcomes. 
Special mathematical techniques, designated as "operations research," have 
been developed to systematize many aspects of this decision-making process. 

In training executives to make this type of decision, the previously men- 
tioned case method has again proved useful. Several published collections of 
"cases" deal not only with personnel problems but also with many other types 
of management decisions in such areas as production, research, distribution, 
advertising, and financing (12, 46, 68). A more specialized technique that has 
aroused considerable interest utilizes management games, or simulation (3, 
13, 43). Gaming as a training procedure is not new, having been used for a 
long time by the armed services in their “war games.” The application of this 
concept to business, however, is more recent, dating from the late 1950s. In a 
management decision game, one or more teams, each representing a business 
firm, make a series of decisions controlling the operations of their firm during 
a specified period. The simulated time period may be as short as a week or as 
long as a year or more. A mathematical model of how the industry operates is 
then used to calculate the outcomes of each team's decisions. With simple 
models, such calculations can be performed manually (2, 28); with more com- 
plex models, a computer is required. The management game thus provides im- 
mediate feedback to the trainees. 

The first management game was developed by the American Management 
Association for use in its executive training seminars (3, 43). Several other 
games have subsequently been developed for use in schools of business and in 
company training programs (see 13, 45, 79). They have even been played by 
mail or leased telephone wires by teams in different cities. 

In contrast to earlier, simpler games requiring 15 to 3o minutes for each 
move, a more complex game was developed in 1960 at the Carnegie Institute 
of Technology (13). The Carnegie game is played at the rate of one or two 
moves a week over a full academic year. Each move requires from two to three 
hours on the part of a 7- to roman team. Between 100 and 300 decisions must 
be made for each simulated month of company time. Many require long range 
plans with regard to production, marketing, or financing. Decisions might have 
to be reached about such matters as conducting research to develop new prod- 
ucts, obtaining funds by issuing debentures or common stock, or putting up a 
new plant building. Information is provided by a variety of regular monthly 
and quarterly reports, as well as by other special reports requested by the team 
as needed. It is the object of this game not only to give the players an aware- 
ness of the complexity of the economic environment in which they must oper- 
ate and the interdependence of company decisions but also to train them to 
cope with such complexity through systematic analysis and effective problem- 
solving procedures. : 

Creativity Training. Industry is constantly clamoring for new ideas—from 
a catchy trade name to the complex creations of modern engineering. In to- 
day’s rapidly changing world, industry needs creative and imaginative man- 
agement more than ever before. It is therefore not surprising to find that the 
development and facilitation of creative thinking have received some attention 
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in management development programs, as well as in the training of scientists 
and engineers. 

Psychological research. on creativity has concentrated chiefly on ways of 
identifying creative persons. It is being increasingly recognized, however, that 
creative talent should be not only identified but also developed. There is grow- 
ing interest in the conditions conducive to creative achievement, in the work 
methods followed by creative producers, and in procedures for developing cre- 
ative productivity (73). 

A widely publicized technique designed to favor original and creative think- 
‘ng is known as “brainstorming” (60). By this method, the individual segre 
3ates in time the production of ideas from their evaluation. It is argued that an 
evaluative set makes the individual too critical and inhibits the development of 
new ideas. Hence the instructions emphasize quantity rather than quality of 
ideas. Subjects are told to express any idea that occurs to them, however 
foolish or unsatisfactory it may seem. Any form of criticism is strictly taboo. 
Selection and evaluation of ideas occur in a completely separate, later stage. 
The hypothesis is that brainstorming will produce not only more ideas in gen- 
eral but also more ideas of high quality than will the more critical traditional 
procedure. 

Research on the effectiveness of brainstorming is still meager, but it tends to 
confirm this hypothesis (53, 54). The advantages of brainstorming may disap- 
pear, however, when the quality criterion gives little weight to originality (76). 
On the basis of rather casual evidence, it has been commonly accepted that 
brainstorming is more effective when carried out in groups than individually 
(60). In a well-designed experiment, on the other hand, individual perform- 
ance was found to excel group performance significantly in mean number of 
ideas produced, mean number of unique ideas, and quality of solutions (74). 
It is possible that group interaction may contribute more to the subsequent 
evaluation of ideas than to their initial production. The relative effectiveness 
of individual and group procedures may also vary with the nature of the prob- 
lem. There are still many unanswered questions about brainstorming. With 


further research, moreover, other techniques to facilitate creative thinking will 
undoubtedly emerge. 


SUMMARY 


Personnel development and training are closely related to personnel selection 
in the effective utilization of manpower. Although the psychology of learning 
has much to contribute to the improvement of training procedures, well- 
designed applied research in this area is still rare. Industrial training is a con- 
tinuing process that can benefit all employees, from beginner to experienced 
worker and from unskilled labor to top management. Training programs in 
industry serve a variety of functions, including orientation, development of 
job skills, technical and professional training, management development, and 
general education. Among the principal training media employed for these 
purposes are lectures, training manuals and other printed materials, films and 
television, training devices, job performance, group-discussion and social-inter- 
action methods, and problem-solving interviews. 

The development of a training program should include task analysis to 
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determine what must be learned, development of training procedures, and ob- 
jective evaluation of the program. The choice of training media and procedures 
depends in part on what is to be taught and to whom. In developing training 
procedures, one should also be guided by the results of available training re- 
search, in connection with such questions as motivation, stress, practice, feed- 
back, part versus whole learning, and transfer of training. The effectiveness 
of a training program should be evaluated in terms of job criteria as well as 
training criteria. 

Recent developments in the automation of training procedures are illustrated 
by training devices, simulators, and teaching machines (or programmed learn- 
ing). The rapidly growing area of management development covers many 
types of training and utilizes a diversity of training techniques. Much of it is 
concerned with the development of human-relation skills and attitudes through 
such techniques as problem-solving interviews, group discussion, the case 
method, and role playing. In other aspects of management development, two 
techniques of special interest are management games (or simulation) and 
creativity training. 
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Motivation and Morale 


Selection and training provide qualified personnel to do each job. But to get 
the job done, the individual must also want to work. The motives that impel 
people to work are complex, and the incentives for which they strive include 
more than the paycheck. Within the limits set by his qualifications, what the 
individual actually accomplishes may vary widely as a function of his attitudes 
toward the company, toward his immediate superior, toward his fellow work 
ers, and toward other aspects of the job. Low morale can make a brilliant 
worker totally unproductive; high morale can lead ordinary men to well-nigh 
incredible feats. 

To be sure, employee selection is concerned not only with abilities but also 
with attitudinal and motivational factors. Similarly, training programs contrib- 
ute to the development of attitudes. Nevertheless, the nature of the work and 
the conditions under which it is performed may substantially affect employee 
motivation. It is chiefly in terms of these job variables that employee motiva- 
tion will be considered in the present chapter. 

Today's widespread concern with problems of industrial motivation and 
morale dates from the classic Hawthorne Studies, which first focused attention 
on the part played by employee attitudes in productivity. Following an exam- 
ination of these studies, we shall consider procedures for investigating em- 
ployee attitudes, together with typical findings. Since interpersonal relations 
are a major factor determining employee attitudes, the study of industrial mo- 
tivation and morale is almost co-extensive with what some writers call “indus- 
trial social psychology” (24, 76). Three broad topics in this field are super- 
vision, communication, and organization theory. Each will be treated in a 
separate section of this chapter. 


THE HAWTHORNE STUDIES 


The series of investigations that have come to be known as the “Hawthorne 
Studies” began in 1924 at the Hawthorne Works of the Western Electric Com- 
pany, in Chicago, and extended over 15 years (54, 57). The focus of these 
studies shifted toward human relations in 1927, when the research came under 
the general direction of Elton Mayo of the Harvard Business School. This 
change in viewpoint, however, was gradual and was not fully effected until 
several years later. The first studies were designed to investigate the effects 
of illumination, rest pauses, length of working days, and pay rates on produc- 
tivity. It soon became apparent, however, that output did not bear a simple 
relation to these factors. Some of the studies yielded apparently haphazard 
results that defied interpretation. In others, productivity rose steadily, regard- 
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less of what experimental changes were introduced. Even when poorer condi- 
tions were reinstated in the course of some experiments, worker performance 
continued to improve instead of dropping back to the earlier level. 

An examination of the actual procedures followed in the course of these ex- 
periments reveals what would now be recognized as an astounding number of 
uncontrolled factors. In their efforts to secure cooperation, the investigators had 
made conditions as pleasant as possible for their subjects. Thus, in addition to 
their exposure to the systematically controlled experimental variables, the sub- 
jects were released from their customary close supervision, worked in a small 
group of congenial co-workers of their own choice, were assigned to a more 
cheerful and comfortable workroom, were allowed to converse more freely dur- 
ing work, received considerable attention from the experimenter and other 
company personnel, and were consulted about the changes that were intro- 
duced. 

Following their recognition of the role of supervisory practices and other 
factors influencing employee attitudes, the Hawthorne investigators embarked 
upon an intensive attitude survey through interviews of over 21,000 workers. 
‘These interviews began as highly structured inquiries into what each employee 
liked and what he disliked about working conditions, supervision, and the job 
itself. It was soon discovered, however, that employees often talked about “ir- 
relevant” topics which were usually of interest in their own right. As a result, 
the structured interviews were replaced by longer, unguided interviews in 
which the employee was allowed to bring up any problem he considered im- 
portant. This survey served several purposes. It was noted that most employees 
welcomed the opportunity to express their views and exhibited more favorable 
attitudes as a result of it. Specific complaints registered during the interviews 
were studied by management with a view to correcting the conditions that 
evoked them. Principally, however, the results of the interviews were used as 
a basis for the discussion of human-relations problems in the company’s super- 
visory training program. 

The interview survey also suggested other problems for investigation. It was 
found that the expressed satisfaction or dissatisfaction often bore little relation 
to the actual condition cited. One employee might be satisfied and another 
dissatisfied with the same condition, such as wages, hours of work, or specific 
features of the working environment. The meaning of these items for the in- 
dividual seemed to depend in part upon their value as status symbols, and this 
in turn often depended upon the attitudes of the group in which he worked. 
There was also evidence of voluntary restriction of output as a result of pres- 
sures from the work group. The formation of informal social groups within the 
organization and the influence they exert upon individuals’ attitudes and job 
performance thus became the subject of the next phase of the Hawthorne 
Studies, 

_Accordingly, an intensive observational study was initiated with a small group 
of workers assigned to a special room. Data were gathered over a six-month 
period by two investigators: an observer stationed in the room who recorded 
verbal and other overt behavior; and an interviewer who explored individual 
attitudes, meanings, and values and sought information on the individual's 
personal history and his life outside the plant. Special attention was given to 
the existence of informal groupings among the men, the relation of these 
groupings to the formal supervisory organization of the department, and 
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the types of social pressures whereby the group controlled individual work per- 
formance. 

The last stage of the Hawthorne Studies is represented by the installation of 
a plant-wide program of employee counseling. This program provided profes- 
sional counselors for problem-solving interviews with any employee having a 
problem to discuss. Its objectives were to help employees in the solution of 
their personal problems and to provide management with general guides for 
improving employee-relations and supervisory procedures. 

All this research activity, together with the original demonstration that em- 
ployee attitudes can counteract the effects of such factors as illumination and 
work schedules, did much to launch the "human-relations" movement in man- 
agement. Today it is widely recognized that employee attitudes are an impor- 
tant determiner of job performance and that interpersonal relations with su- 
pervisors and co-workers play a major part in the development of these atti- 
tudes. 

The Hawthorne Studies also served to highlight the need for controlling at- 
titudinal variables in the design of psychological experiments. The term 
"Hawthorne effect" is now commonly used to refer to the influence that par- 
ticipation in an experiment may have upon the subject's behavior. Insofar as 
being a subject in a particular experiment may in itself involve certain inter- 
personal relations with the research personnel, as well as associated changes in 
status, self concept, or environmental milieu, a control group should be exposed 
to the same conditions in order to separate their effects from those of the 
experimental variable. 


MEASUREMENT OF EMPLOYEE ATTITUDES 
AND MORALE 


Systematic surveys of employee attitudes are now an accepted management 
practice. Some of the uses of these surveys were already suggested by the Haw- 
thorne Studies: to indicate needed changes in policies, procedures, or facilities; 
to provide a realistic basis for supervisory selection and training programs; and 
to help in evaluating the effect of different factors upon job performance. At- 
titude surveys have also been utilized in more general research on employee 
motivation, to find out what people want in a job and how these wants vary in 
relation to age, sex, education, occupational level, or other worker character- 
istics. In such research, data are often gathered on a community-wide or nation- 
wide basis, rather than being limited to a single company or plant. 
Methods for Measuring Attitudes. The major procedures for gathering 
data on employee attitudes and morale include interviewing, opinion question- 
naires, attitude scales, projective and other indirect techniques, observational 
methods, and nominating techniques. A combination of two or more of these 
techniques is often desirable. To ensure frankness, the respondent's anonymity 
and the confidentiality of the data should be preserved, whatever technique is 
employed. à à 
Interviews may be conducted with every member of a small unit or with : 
representative sample of a larger unit, such as a plant or company. A specia 
form is the exit interview, held with employees who are about to leave the 
company. Although ranging from highly standardized to completely unstruc- 
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tured, most attitude-survey interviews are vf the guided, or patterned, type 
(see Chapter 4). 

While more superficial than interviewing techniques, opinion question- 
naires make it possible to reach a large number of employees in a short time. 
Anonymity can be easily provided by having employees drop the question- 
naires in a ballot box or mail them directly to an outside survey agency. Typi- 
cally, opinion questionnaires include questions on specific items within such 
areas as working conditions, relation to fellow workers, supervisory relations, 
company policies, pay, employee benefits, and opportunity for advancement. 
Responses may be analyzed so as to identify major areas of dissatisfaction; to 
single out specific items, such as employee cafeteria or vacation policy, about 
which opinions may be outstandingly favorable or unfavorable; to compare the 
frequency of dissatisfaction in different departments or different kinds of jobs 
within the company; or to answer other questions pertinent to the local cir- 
cumstances. 

Like opinion questionnaires, attitude scales are suitable for mass administra- 
tion in written form. Rather than surveying attitudes toward a wide variety of 
specific items, however, such scales concentrate on measuring intensity of atti- 
tude in a single broad area, such as attitude toward the company or over-all 
job satisfaction. They consist of a series of statements whose scale values have 
been determined by procedures similar to those described in Chapter 4 for the 
construction of scaled checklists. Typical items for use in employee attitude 
measurement are given in Table 8. The statements are presented in random 


TABLE 8 Illustrative Statements for Use in an Employee Attitude Scale 


(Selected from Uhrbrock, 73) 


a_l 


Scale 
Value Statement 
OA ee 
10.4 I think this company treats its employees 
better than any other company does. 
8.9 A man can get ahead in this company if he tries. 
7.4 On the whole, the company treats us 
about as well as we deserve. 
541 The workers put as much over on the company 
as the company puts over on them 
2.1 You've got to have "pull" with certain people 
around here to get ahead. 
0.8 An honest man fails in this company. 


order and without the accompanying scale values. The respondent's attitude 
score is found by taking the mean scale value of all the statements he en 
dorses. 

In a variant of this technique (5), the statements are still selected in terms 
of scale values, but the individual is required to respond to each statement by 
marking one of five terms: Strongly Agree, Agree, Undecided, Disagree, 
Strongly Disagree. He thus obtains a score of 1 to 5 on each statement, and 
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the sum of these values is taken as his total job-satisfaction index. This pro- 
cedure has the effect of providing finer differentiation of attitudes and raising 
reliability (5). 

Projective and other indirect techniques for investigating attitudes are rela- 
tively unstructured, allowing the respondent considerable freedom of expres- 
sion. Insofar as they are indirect, these techniques are believed to be less sub- 
ject to rationalization and other face-saving devices and hence more likely to 
reveal genuine attitudes. Several familiar projective techniques, such as word 
ciation, sentence completion, or picture interpretation, have been adapted 
for attitude measurement. Another example is provided by a survey conducted 
at General Motors and known as “My Job Contest” (16). Employees were in- 
vited to participate in a letter-writing contest on the topic, “My Job and Why 
| Like It,” for which 5,000 prizes were awarded. The content of the letters, 
which totaled nearly 150,000, was analyzed with reference to 58 “themes,” or 
factors associated with favorable job attitudes. 

Observational methods for investigating employee attitudes and morale have 
already been illustrated in the description of the Hawthorne Studies. Such 
techniques require the services of a trained observer, who may spend several 
months in a department. While maintaining friendly relations with the employ- 
ces, he should serve only as an observer. It should be made clear to the em- 
ployees that he has no administrative functions or authority. Although at first 
the presence of a stranger creates a strained and unnatural situation, these 
difficulties can usually be overcome with time. The technique is used especially 
in studying group structure and interpersonal reactions among employees. 
Qualitative data pertaining to attitudes toward company policies, supervisors, 
and other aspects of the job situation can also be obtained from the comments 
and general behavior of the employees. n 

Another procedure for investigating group structure is the nominating tech- 
nique derived from sociometry (see Chapter 4). Each employee is asked to 
name the person in his group whom he would choose as the best worker, the 
most fair-minded, the one who would make the best supervisor, or the one who 
fits some other given designation. The results can then be plotted in a “socio- 
gram,” as illustrated in Figure 34. Each circle in these diagrams represents a 


Fig. 34  Sociograms of three groups. (Figures 6.3, 6.4, and 6.5 from 
Industrial psychology and its social foundations by Milton L. Blum. 
Harper & Row, Publishers, Incorporated, 1956.) 


Q 


Cohesive group without Cohesive group with Poorly structured 
strong leader strong leader group with cliques 
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person. If individual 1 names individual 2 as his choice, an arrow is drawn 
from circle 1 to circle 2. Mutual choices are indicated by two arrows pointing 
in opposite directions. Sociograms are useful in identifying informal leaders, 
cliques which may disrupt group unity, and other characteristics of group 
structure. 

Factors in Job Satisfaction. A number of attitude surveys have been con- 
cerned with what workers want from their jobs. An example is an opinion sur- 
vey of about 3,700 men and women applying for jobs with the Minneapolis 
Gas Company (37). Each applicant was given a list of 1o factors and was 
asked to rank them in order of their importance to him, a rank of 1 represent- 
ing the most important. The average ranks assigned by men and women are 
shown in Table 9. It will be noted that the men gave the highest ranks to 
security, advancement (opportunity for promotion), and type of work Cin- 
teresting work that they like). Company (feeling of pride in the company), 
pay, congenial co-workers, and a considerate and fair supervisor received in- 
termediate ratings. Hours, working conditions, and employee benefits were 
regarded as least important. 


TABLE 9 Mean Ranks Assigned to Job Factors by Applicants 


(From Jurgensen, 37, p. 353) 
See Sa eee 


Job Factor 3,345 Men 378 Women 
Security 3.3 4.6 
Advancement 3.6 4.8 
Type of work 3.7 2.8 
Company 5.0 5.4 
Pay 6.0 6.4 
Co-workers 6.0 5.4 
Supervisor 6.1 5.4 
Hours 6.9 6.1 
Working conditions Zl 5.8 
Benefits 7.4 8.2 


With minor exceptions, this rank order agrees with those found in several 
other similar surveys (30, 75). It has been frequently pointed out that such 
rank orders differ in a number of ways from those obtained when executives 
are asked what they believe the workers want. As a rule, executives rank pay 
much higher than the workers do themselves! (3, ch. 4; 75, ch. 15). The 
interpretation of reported order of importance, however, is not entirely clear. 
For one thing, people may give rationalizations because they sound more ac 
ceptable than their actual preferences. Some may not be fully aware of their 
own motives. It is interesting to note that, in a study of 4o retail-grocery-store 
workers, a projective technique (story completion) revealed greater emphasis 
on pay than did direct questioning of workers about their current sources of 
job satisfaction (26). 

Another approach to the question of job satisfaction is to have employees 
rate their attitudes toward different aspects of their present job, as well as their 


1 Much effort has been expended, too, in working out elaborate financial incentive 
plans. For surveys and critical evaluations of such plans, see Blum (3, ch. 3), Rothe 


(61), and Viteles (75, ch. 3). 
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over-all job satisfaction Ce.g., 1). Those factors that correlate highly with over- 
all job satisfaction can then be regarded as the most important for the workers, 
those that correlate low as the least important. For example, if attitude toward 
employee benefits correlates negligibly with over-all job satisfaction, it means 
that an individual's satisfaction or dissatisfaction with his company's policies 
in this regard contributes little or nothing to his job satisfaction. 

The relative importance assigned to job factors has been found to vary some- 
what with the characteristics of the workers. In Table 9, certain differences can 
be seen between the mean ranks given by men and women. For the women, 
type of work stands out as the most important factor, benefits as the least im- 
portant, and all other factors are fairly close to each other. A number of surveys 
have likewise found differences associated with age, education, type of work, 
and job level. 

With regard to job level, lower-level workers tend to stress factors extrinsic 
to the work, such as security and pleasant relations with co-workers and super- 
visor, while higher-level workers put more emphasis on the opportunities for 
self-expression, interesting work, and leadership that the job itself provides (8; 
12; 75, ch. 20). Several investigators have interpreted these findings in terms of 
a theory of “need hierarchy,” first proposed by Maslow (52, 53). According to 
this theory, needs fall into a hierarchy of prepotency, beginning with basic 
physiological needs for air, food, and water and proceeding through such needs 
as those for safety Cor security), belongingness, self-esteem, and independence, 
to the need for self-actualization (i.e., using one's capacities to the fullest ex- 
tent). When a need is satisfied, it no longer serves to motivate behavior, and 
the next higher need in the hierarchy emerges. Although this theory is con- 
sistent with observed job-level differences in sources of job satisfaction, it 
should be noted that the nature of the job itself may also account for some of 
these differences. If a job is so organized as to allow the worker little or no 
self-expression, he must seek satisfaction in factors extrinsic to the work. This 
situation can be seen in semiskilled factory jobs as well as in the more routine 
types of clerical work. , 4 

Relation of Attitudes to Job Performance. The relation between atti- 
tudes and job performance is not a simple one. To be sure, absenteeism, tardi- 
ness, accident rate, and turnover tend to be consistently higher the poorer the 
reported job attitudes (4, 30, 60). Studies of productivity, however, have 
vielded inconsistent results (4, 30). Some of these discrepancies may arise 
from methodological differences among investigations. But there are other fac- 
tors that complicate the relationship itself. f 

To understand the relationship between employee attitudes and productiv- 
ity, we must find out how productivity is related to the employee's own goals 


(4). For example, an individual strongly motivated to advance to a higher 
more for this reason and at the same time 


level in the company may produce 
ina dnt ;orkers may be 


be dissatisfied with his present job. Or acceptance by his co-w 
of prime importance to an employee, and such acceptance may be pisi 
upon restriction of output to conform to a norm set by his group. This woulc 


be especially true in a cohesive group with high morale and strong feelings of 
d quality of output may bear a 


loyalty. Under some circumstances, quantity an 1 
different relation to the individual’s goals, as when a skilled craftsman resists 
efforts to increase speed if he believes the quality of his work will suffer. For 
some persons, a high degree of expressed job satisfaction may mean no more 
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than complacency, a condition not especially conducive to the exertion of great 
effort. 

It is apparent that a proper interpretation of the relation between employee 
attitudes and productivity requires complex experimental designs, comprising 
more variables than have usually been investigated. An example is provided by 
a study conducted by Katzell and his associates (10, 41,) in 72 wholesale ware- 
houses of a pharmaceutical company. Comparison of the different warehouses 
showed that both productivity and average job satisfaction were related to such 
situational variables as plant size, community size, proportion of women work 
ers, and unionization. The positive correlation between productivity and job 
satisfaction found in this study could be explained in terms of plant differences 
in these situational variables. 

An investigation by Herzberg and his co-workers (31) sheds further light 
upon the sources and effects of employee attitudes. Data were gathered 
through intensive interviews of about two hundred engineers and accountants 
employed by nine industrial plants in the vicinity of Pittsburgh. In each inter- 
view, the employee was asked to think of a time when he felt exceptionally 
good or exceptionally bad about his job and to describe what happened. An 
account of both a “good” and a “bad” situation was obtained from each person. 
The interviewer also inquired into the duration of the situation itself and of 
the attitudes it engendered, the factors leading to the satisfaction or dissatis- 
faction including both objective events and the individual's perception of 
them), and the effects of the situation on the individual's feelings, health, job 
performance, interpersonal relations, and other behavior. 

An analysis of the content of these interviews yielded a wealth of specific 
data. In interpreting their findings, the authors differentiate between "satisfi- 
ers" and "dissatisfers." Their results show that the satisfiers, producing favor- 
able job attitudes, usually pertain to the job itself. Predominant among such 
factors, in order of their frequency, were achievement, recognition, character- 
istics of the work, responsibility, and advancement. These factors center around 
opportunities for self-actualization and growth. Recognition was most effective 
when related to genuine achievement and progress. Otherwise, its effects 
tended to be trivial and short-lived. The dissatisfiers, on the other hand, more 
often referred to job context, as illustrated by ineffective or unfair company 
policies, incompetence of supervisor, poor interpersonal relations with super- 
visor, and unsatisfactory working conditions. Furthermore, dissatisfaction could 
be aroused by a wider variety of factors than could satisfaction. 

It might be noted, too, that salary appeared with intermediate frequency 
among both satisfiers and dissatisfiers. However, mention of salary nearly al- 
ways pertained to obtaining raises or failure to receive an expected raise. Thus 
salary was perceived either as an instance of unfairness in supervision and com- 
pany policy or as tangible recognition for achievement and as a sign of prog 
ress. 

With regard to the effects of attitudes, the authors conclude that dissatisfiers 
Ccontextual factors) influence principally emotional adjustment, job turnover, 
absenteeism, and the like. But only the satisfiers Cintrinsic job factors) can 
provide the positive motivation to greater productivity. Thus human-relations 
programs and fringe benefits can at best eliminate dissatisfaction, but they can- 
not provide motivation for better job performance, especially of a creative na- 
ture. This is a provocative hypothesis, which should be tested further, partic- 
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irly with other kinds of work and other job levels. It would be interesting, 
instance, to repeat the study with such groups as salesmen, executives, 
lce clerks, and assembly-line operators. There is evidence from other sources 
suggest that persons who enter different occupations vary systematically in 
interests and other personality traits (see Chapter 17). These differences may 
very well be reflected in sources of job satisfaction and dissatisfaction. 


SUPERVISION 


Despite the many complexities that beset the study of employee attitudes, one 
fact that emerges clearly is the importance of supervisory practices. Attention 
has been focused particularly on the key role of the foreman or first-line super- 
visor in shaping employee attitudes (see, e.g., 21, 29). In the previously cited 
“My Job Contest,” for example, supervision ranked first in frequency among 
the 58 themes mentioned in the letters (16). Surveys of the sources of job dis- 
satisfaction have repeatedly shown the behavior of the immediate supervisor 
to be a major factor (30, 31, 75). Supervisory behavior has also been found 
to be significantly related to absenteeism, accidents, grievances, and turnover 
in different departments (20). It is interesting to note, too, that some invesi- 
gators have found group productivity to be more closely related to the super- 
visor's attitudes and behavior than to employee attitudes (39, 40, 47). Meas- 
ures of employee attitudes may reflect attitudes on a variety of matters, some of 
which may be unrelated to productivity. Supervisory behavior, on the other 
hand, is more likely to have a direct bearing on productivity and to influence 
those employee attitudes that are more relevant to effective job performance. 

A series of investigations conducted at the Institute for Social Research of 
the University of Michigan has been concerned with the characteristics of suc- 
cessful supervisors (47). Over 7o studies were carried out in many different 
industries, as well as in hospitals, government agencies, and voluntary organiza- 
tions. Data were gathered at all management levels and on workers in a wide 
variety of jobs, from unskilled labor and clerical work to scientific research. The 
general design of these studies involved essentially a comparison of the super- 
visory practices followed in the "best" and "poorest" units of an organization. 
The criteria of administrative effectiveness used in identifying these units in- 
cluded productivity; job satisfaction; turnover, absenteeism, and similar indices; 
costs; scrap loss; and employee and managerial motivation. 

A major finding of the Michigan studies was that supervisors of the high- 
producing units were more often “employee-centered,” while supervisors of 
low-producing units were more often “job-centered” (Figure 35). By 
“employee-centered” the investigators mean specifically that the supervisor 
places the primary emphasis on the human problems of the workers; he is 
friendly and supportive toward the worker, trying to build and maintain the 
individual's sense of personal worth; and he provides only general supervision, 
allowing considerable individual freedom in carrying out the job. In contrast, 
the job-centered supervisor places primary emphasis on production itself, seeing 
that the workers use the prescribed methods and overseeing their work closely. 
The more effective supervisors were also found to play a more differentiated 
role. They spent less time performing the same functions as the workers and 
more time in motivating their subordinates, in keeping them informed about 


policies, and in long-range planning (38). 
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NUMBER OF FIRST-LINE SUPERVISORS WHO ARE: 


Job-centered Employee-centered 


High-producing sections 


Low-producing sections 


Fig. 35 Relation between supervisory practices and group productivity. 
CFrom Likert, 47, p. 7.) 


The investigators recognized, of course, that such comparative studies do not 
in themselves permit an analysis of cause-and-effect relations. It is conceivable, 
for example, that low group productivity leads to closer supervision and more 
pressure on the part of the supervisor, rather than vice versa. Although most of 
the research in the Michigan series utilized a cross-sectional approach, there 
are some longitudinal data to support the hypothesis that supervisory practices 
influence productivity. In one company, managers of high-producing and of 
low-producing divisions were shifted. Regardless of the initial productivity level 
of the new division, these managers tended to retain their former orientation 
toward the employees. Productivity itself changed after the shift, the low- 
producing divisions improving and the high-producing divisions slipping under 
the new supervisors. Moreover, the favorable change occurred more rapidly 
than the unfavorable (47, p. 12). 

Another approach to supervision is illustrated by the Ohio State Leadership 
Studies, most of which were concerned with descriptive analyses of organiza- 
tional structure, executive functions, and supervisory behavior in many differ- 
ent settings (64). One of these studies, already cited in Chapter 5 in connec- 
tion with the evaluation of human-relations training, identified two major 
dimensions of supervisory behavior: "Consideration," characterized by a 
friendly, warm, considerate supervisory relationship, as contrasted with an im- 
personal and authoritarian relationship; and "Initiating Structure," pertaining 
to the extent to which the supervisor actively plans and directs group activi- 
ties oriented toward goal attainment (17, 67). 

These two dimensions will be recognized as bearing a general resemblance to 
the employee-centered and job-centered orientations identified by the Michigan 
group. In the Ohio State research, however, the two characteristics were found 
to be independent dimensions. In other words, an individual's rating on one is 
unrelated to his rating on the other. It was found further that supervisors rat- 
ing low on both traits proved to be weak and ineffective leaders, who were 
often bypassed by their own subordinates (20), Although the results varied 
somewhat with the situation, in general the most effective leaders were above 
average in both Consideration and Initiating Structure (67). Supervisors 
emphasizing either dimension at the expense of the other tended to be less 
effective. It might be added that the Michigan Investigators, too, observed that 
the foremen in charge of high-producing units tended to be both employee- 
centered and oriented toward high production goals (47, p. 8). But this re- 
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lationship is not clearly demonstrated or emphasized in the published accounts 
of the Michigan research. 

Further research on the dimensions of Consideration and Initiating Struc- 
ture indicated that the relation between these supervisory characteristics and 
certain. indices of employee morale may be curvilinear (19). Thus both em- 
ployee grievances and turnover within different units of a manufacturing plant 
were related to supervisors' ratings in these dimensions only at the extremes of 
low Consideration and high Structure. Beyond certain critical levels, increas- 
ing Consideration or decreasing Structure had no effect on grievance and turn- 
over rates. The same investigation revealed interaction effects between the two 
supervisory dimensions. Specifically, the relation between Structure and griev- 
ance or turnover rate differed among foremen who rated high, medium, or low 
in Consideration. 

It will be recalled from Chapter 5 that a follow-up study of the effects of 
supervisory training, conducted at the International Harvester Company, re- 
vealed the importance of "leadership climate" as represented by the behavior, 
attitudes, and expectations of the foreman's own supervisor. In a still later 
follow-up, it was found that management rated as most proficient those fore- 
men whose scores (unknown to the raters), were high in Initiating Structure 
and low in Consideration (18, p. 312). This occurred despite the fact that fore- 
men with this trait pattern had work groups with higher rates of accidents, 
absenteeism, and turnover than did those with other patterns. Apparently 
these foremen received high ratings because they fitted the stereotype of effec- 
tive leadership. The merit ratings were thus reinforcing supervisory practices 
that were inconsistent with those taught in the supervisory training course. 

Several other studies have demonstrated that supervisory practices must be 
viewed against the management context in which the supervisor operates. One 
relevant factor, for instance, is the amount of influence the supervisor can 
wield with his own superiors. A study (56) conducted in a large public 
utility showed that such supervisory behavior as "siding with employees and 
"social closeness to employees" tended to be positively correlated with em- 
ployee satisfaction if the supervisor had enough influence to have his recom- 
mendations accepted by higher management. For supervisors lacking such in- 
fluence, however, the correlations tended to be zero or negative. In such cases, 
the supervisory behavior probably served only to arouse in the employee ex- 
pectations that remained unfulfilled and thus increased their feelings of frus- 
tration. on 

It is evident that there is no one simple formula for improving supervision. 
Here as in all situations involving interpersonal relations, there is no set of 
rules for “getting along with people.” As Likert (46) has pointed out, effective 
supervision must be adaptive and relative. Procedures that work for one person 
or in one situation may be useless or harmful with another person or in a differ- 
ent situation. Because of differences in past experiences, workers vary in the 
way they perceive the same behavior on the part of a supervisor. Employees 
also vary in their values, goals, and expectations. Personality differences, too, 
may make one worker welcome additional responsibility and freedom from 
supervision, whereas another feels lost and insecure in the same situation. 

A second important factor is the personality and habitual behavior of the 
supervisor himself. Any sudden change in the supervisors manner or proce- 
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dures, such as might occur after a supervisory training program, may be vicwe 
with suspicion and distrust by his subordinates. Moreover, supervisory practices 
that are inconsistent with the supervisor's personality and run counter to his 
deep-rooted behavior patterns are likely to be superficial and insincere. In this 
lies one of the chief fallacies of the popular books and high-pressure courses on 
how to get along with people. Such an approach to interpersonal relations leads 
to behavior that is contrived and deliberate, rather than spontaneous and forth- 
right. Not only is such behavior likely to repel one’s associates, but it is also 
questionable from the standpoint of the individual’s own mental health (36). 

A third factor is the leadership climate or managerial context within which 
the supervisor must function. Important as his job is, the first-line supervisor 
cannot operate in a vacuum. His over-all success as a supervisor, as well as the 
relative effectiveness of specific supervisory practices, depends upon the com- 
pany’s organizational structure, the attitudes of higher management, and other 
situational conditions. 

Finally, it must be recognized that for effective supervision an emp! yee- 
centered, human-relations point of view is not enough. This limitation has 
been noted by some of the Michigan investigators, more clearly demons <ated 
in some of the Ohio State studies, and vigorously expounded by Herzberg and 
his associates in the report of their previously cited investigation of employee 
attitudes (31). These investigators emphasized the importance of the sense of 
achievement that can be provided only by the work itself. Part of the super- 
visors role in this connection is to recognize good work and to reward it 
selectively—rather than praising indiscriminately just because to do so is “good 
human relations”! The supervisor can also help by structuring and assigning 
jobs so as to give the individual more opportunity for creativity and self- 
actualization. What the supervisor can accomplish in this regard, of course, is 
limited by the organizational system within which he must work. This is just 
another example of the fact that supervisory practices must be integrated with 
organizational structure. We shall take a closer look at some of the problems of 
organizational structure in a later section of this chapter. 


COMMUNICATION 


Nature and Scope of Industrial Communication. With the increasing 
size of industrial organizations, communication assumes more and more im 
portance as a means of linking the various parts together, channeling incoming 
information to the appropriate outlets, and ensuring coordinated action. In- 
dustrial communication may take many forms. It can be as personal as a 
luncheon conversation between two vice-presidents or as impersonal as an an- 
nouncement over the plant's public address system. It can be as succinct as a 
one-sentence bulletin-board notice or as detailed as a 100-page report, as formal 
as a stockholders’ meeting or as informal as the transmission of a rumor over 
the grapevine. A common and effective medium of communication is the face- 
to-face interview in all its forms, such as performance-appraisal, problem- 
solving, and counseling interviews, Group meetings are among the most widely 
used communication techniques in all types of organizations. Company train- 
ing programs serve a communication function, as do employee attitude surveys. 
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estion systems are a traditional means of routing employee ideas to man- 
ment. 

Within an organization, communication may flow downward, as when 
policy statements, operating procedures, and the like, are transmitted from 
management to employees; upward, as illustrated by employee suggestions and 
grievances; and horizontally, or across units at the same level. The mere pre- 
sentation of material in oral or written form does not assure communication. 
lhe complete communication process comprises transmission, reception, com- 
prehension, and acceptance or rejection. What is communicated includes not 
only cognitive material, such as facts, procedures, policies, and ideas, but also 
emotional climate, feelings, and attitudes. 

We must bear in mind, too, that actions speak louder than words and are 
thus a part of the communication system. A raise, a promotion, or a reduction 
in job responsibility are likely to be perceived by the employee as evaluations 
f his job performance—and as such they will carry more weight than a verbal 
imunication. When supervisors in a large public utility were asked, “On 
what basis do you judge your standing with your immediate superior?" many 
ore of their replies referred to actions than to verbal appraisals (47, p. 54). 
gure 36 shows the percentage who checked each answer. 

That communication has an important influence on employee attitudes and 
morale is quite generally recognized (62). Empirical demonstrations of this 
influence, however, are more difficult to find. Some suggestive data are repro- 
duced in Figure 37. Based on a survey of white-collar workers in a large public 
utility, this graph shows a consistent relation between freedom of communica- 
tion and absenteeism in different departments (50). Apparently those depart- 
ments in which employees felt free to discuss their problems with the supervisor 
had fewer absences than those in which the atmosphere was less conducive to 
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Fig. 36 Replies of supervisors to the question: "On what basis do you 
judge your standing with your immediate superior?" (From Likert, 
47, p. 54.) 
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Fig. 37 Relationship between absenteeism and supervisory communication 
in different departments. (From Likert, 47, p. 18.) 


free communication. Similar results were obtained with blue-collar workers in 
the same company. 

The Communication Process. Industrial psychologists have investigated 
the communication process from many angles. Some have been concerned with 
the relative effectiveness of different media. In one such study (11), infor- 
mation was transmitted by five different methods in various departments of an 
industrial plant. Later, tests were administered to determine how much the 
individual employees had actually received and retained. The results are 
shown in Table ro. It will be noted that the combination of oral and written 
communication gave the best results but that oral only was more effective 
than written only. Bulletin board and grapevine yielded the poorest results. 


TABLE 10 Mean Information Test Scores of Employees 


Receiving Communication through Different Media 
(From Dahle, 11, p. 245) 


——Á—— Áo S s 


Number of Mean 

Medium Employees Test Score? 
Combined oral and written 102 7.70 
Oral only 94 6.17 
Written only 109 4.91 
Bulletin board 115 3.72 
Grapevine only (control group) 108 3.56 


* All differences are significant at the 5 per cent level or better except that between the 
last two means in the column. 
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Written and oral communications each have certain intrinsic advantages 
that make them suitable for different purposes. Written communications are 
more accurate and provide a permanent record. They permit fuller coverage of 
detail and can be cited as an official, authoritative reference. Oral communica- 
tions, on the other hand, are more personal, permit the give-and-take of two 
way communication, can be adapted to the individual listener, and can be re- 
inforced by the appropriate tone of voice, facial expressions, and gestures. 

With regard to written communication, extensive early research is available 
on the legibility of printed matter as a function of such typographical char- 
acteristics as style and size of type, amount of space between letters, color of 
print and of background, and length of lines (see 55). A number of studies 
have been more specifically concerned with the content and readability of such 
employee publications as handbooks, plant magazines, and company bulletins 
(6, 62). Using a measure known as the Flesch Index, to be discussed in Chap- 
ter 12, these studies have demonstrated that the large majority of employee 
publications are too difficult for employees to read. Rewriting them in a style 
better suited to the educational level of their readers would increase their 
effectiveness as communication media. The same techniques have been ap- 
plied to checking the readability of union-management agreements, with sim- 
ilar results (42, 70). 

Another approach has focused on the improvement of communication skills, 
including both the transmitting and the receiving of information. There has 
been considerable interest, for example, in the development of reading im- 
provement programs for executives (43). A more comprehensive and individu- 
alized type of training is represented by the “communication clinic,” which has 
been successfully employed in a number of organizations (63). Concerned with 
both psychological and linguistic aspects of communication, this approach pro- 
vides whatever remedial procedures are needed to increase the individual's 
effectiveness in reading, writing, and speech. £ do 

Considerable basic research has been done on the role of communication in 
modifying beliefs and attitudes (33-35, 59). Several studies have been con- 
cerned with factors influencing the persuasiveness of communication, such as 
order of presenting ideas, prestige of the communicator, and various character- 
istics of recipients. Other studies have investigated changes in both retention 
and acceptance of the communicated material over a period of time. Such re- 
search has broad implications, tying in with learning principles on the one 
hand and with the social psychology of propaganda and mass communication 
on the other. 

Communication Barriers and Breakdowns. Special attention has been 
given to the factors associated with communication failures Gz 47, ch. 4). 
Certain supervisory practices and attitudes may reduce the effectiveness of up- 
ward communication, as illustrated in Figure 38. It will be seen that, in those 
departments where the men felt the greatest “unreasonable pressure” for better 
performance, only a small percentage said they would take grievances and com- 
plaints first to their superiors. The less the pressure, the greater the percentage 
of men who reported that they would communicate first with their superiors in 
these matters, Similarly, a correlation of .73 was found between the extent of 
unreasonable pressure felt by the men and the difficulty they reported in try- 
ing to communicate ideas for improving operations to higher management 


levels (47, p. 45). 
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Fig. 38 Relationship between supervisory practices and freedom of 
upward communication. (From Likert, 47, p. 45.) 


Attitudes of hostility and distrust not only may reduce the flow and accept- 
ance of information but also may lead to deliberate distortion of communica- 
tion at all organizational levels. Status relationships and organizational climate 
may likewise hamper the upward flow of communication (32). In some or- 
ganizations, superiors are told only what subordinates think they want to hear. 

A common cause of communication failures is to be found in the different 
frames of reference against which individuals may perceive a situation. It is 
well known that perception is selective and that each person tends to perceive 
what fits in with his own experiences, interests, and needs. This fact was 
demonstrated when 23 middle-management executives participating in a com- 
pany training program were requested to identify the major problem facing 4 
company about which they had read (15). Although the instructions asked 
for the problem that should be dealt with first by a newly appointed company 
president—and thus emphasized a broad, company-wide perspective—the ex- 
ecutives tended to perceive chiefly problems in their own areas, such as sales, 
organizational structure and policy, or human relations. 

Differences in the way the same situation is perceived by persons at differ- 
ent organizational levels are revealed when superiors and subordinates are ques: 
tioned about the effectiveness of their intercommunications. Table 11 sum- 
marizes results obtained at two management levels and with regard to both 
downward and upward communication. The differences are conspicuous: 
Whereas 100 per cent of the top staff believed they always or nearly always 
told foremen in advance about changes, only 63 per cent of the foremen 
thought so. Similarly, 92 per cent of the foremen reported that they always 
or nearly always informed their men in advance about changes, but only 47 
per cent of the men were of the opinion that the foremen did so. The same 
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TABLE 11 Effectiveness of Communication as Perceived by 
Superiors and Subordinates 
(Adapted from Likert, 47, pp. 52, 53) 


——_: ee ———— 


Do superiors tell Top staff Foremen Foremen Men say 
subordinates in advance about says as say about say as about 
changes that will affect to own top staff's to own foremen's 
them or their work? behavior behavior behavior behavior 
Always 70% 27% 40% 22% 
Nearly always 30 36 52 25 
More often than not - 18 2 13 
Occasionally - 15 5 28 
Seldom - 4 1 12 

Do superiors get Top staff Foremen Foremen Men say 
subordinates’ ideas says as say about — say as about 
and opinions in to own top staff's to own foremen's 
solving job problems? behavior behavior behavior behavior 
Always or almost always 70% 52% 73% 16% 
Often 25 17 23 23 
Sometimes or seldom 5 31 4 61 


trends can be seen when the question deals with the frequency with which 
subordinates’ ideas and opinions are obtained by superiors in solving job prob- 
lems. f 

Communication breakdowns often result from preconceived notions, preju- 
dices, stereotypes, and strong emotional associations with certain words Csee 
27). A tendency to overgeneralize or to jump to conclusions may also seriously 
retard communication. The “evaluative tendency” has been described as an- 
other barrier to effective communication (58). When we hear or read a state- 
ment made by another person, often our first reaction is to express agreement 
or disagreement with it, i.e., to evaluate it within our own frame of reference, 
rather than trying to understand it from the other person’s point of view. The 
attempt to remove this communication barrier underlies the nondirective or 
client-centered techniques developed by Carl R. Rogers and his associates for 
use in counseling and psychotherapy (see Chapter 14). These techniques are 
now being applied to more and more areas of interpersonal relations. : 

Communication Networks. ‘The typical organization chart, showing who 
reports to whom, may be regarded as a map of the organization's formal com- 
munication channels. That information often flows through other, informal 
channels even in a relatively rigid military organization is illustrated in Figure 
39. The routing of information through an organization has important im- 
plications for decision making. The decisions made by a top executive depend 
upon the information he receives. The larger the organization, the more likely 
it is that information is highly screened, condensed, and interpreted before it 
reaches the top management levels. Persons at lower organizational levels may 
thus influence decisions by their selective filtering of information. Of particular 
interest is the type of communication network illustrated in Figure 4o, in 
which all information going to a top executive must filter through an assistant. 
As a result, the assistant may become the one who really makes the decisions 
and runs the organization. Such a situation may not only limit organizational 
effectiveness but may also disrupt morale. 
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Fig. 39 Formal and informal organization of a military unit. The broken 
lines show with whom each person spent the most time in getting work 
done. (From Stodgill, R. M., The sociometry of working relationships 

in formal organizations, Sociometry, Vol. XII, 1949, page 282, J. L. 
Moreno, M.D., Editor, Beacon House, publishers.) 


A number of techniques have been employed to analyze the flow of com- 
munication in both large organizations and small groups (13). Asking individ- 
uals with whom they spend their time during working hours is one approach. 
Having a trained observer stationed in a workroom, as in the Hawthorne 
Studies, is another. Other techniques study the communications more directly. 
Thus in a “duty study" a record is made of all communications passing a par- 
ticular spot, such as a given executive's desk, throughout the day. In a “cross- 
sectional analysis” each person is asked what communication Cif any) was in 
progress at a specified point in time. 

Another technique, known as “ecco analysis" Cepisodic communication 
channels in organization), follows a specific unit of information throughout the 
communication network (13). In applying this method, for example, the in- 
vestigator might ask each individual whether he knows some particular item 
of information. If so, the respondent is further asked from whom and through 
what medium he learned of it and in what respects, if any, the information he 
received differed from that presented by the investigator. This method is espe- 
cially well suited for the study of grapevine and other informal communica- 
tion channels. Data can be analyzed in various ways to show, for instance, ex- 
tent and direction of information flow, distortion of facts in the process of 
transmission, and relative use of different media. 

Still another approach is represented by laboratory studies of communication 
nets in small task-oriented groups (2, 44, 65, 72). Figure 41 shows some of the 
communication nets investigated in these studies. Subjects are seated around a 
table but are separated by radial partitions so that they cannot see each other 
and must communicate by passing notes through slots in the partitions. Each 
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Fig. 40 A communication network in which B plays an important 
filtering role. (Adapted from Leavitt, 45, p. 201.) 


group is assigned problems whose solution requires the pooling of information 
by all group members. The experimenter sets up different communication nets 
by opening some slots and closing others. 

In the star pattern (Figure 41), all information must be sent to a central 
person. C, who transmits the answers back to each individual member. In the 
chain, information is sent by the two peripheral members P, through the mid- 
dle men M, to C. In the circle, there is no centralized organization, informa- 
tion traveling back and forth among all members. With regard to speed and 
accuracy of problem solving, the star yielded the best results and the circle the 
poorest. However, circle nets had better morale and were better able to adapt 
to sudden and confusing changes of task. A leader and a stable form of or- 
ganization emerged most readily with the star pattern, more slowly with the 
chain, and failed to emerge with the circle. It is quite likely that different 
kinds of communication nets may be most effective for different purposes. The 
chief contribution of this series of studies was to point the way for future 
research on the effect of communication patterns on group performance and 


morale. 


Fig. 41 Three types of communication nets. (Adapted from Leavitt, 44.) 
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ORGANIZATION THEORY 


Supervision, communication, and other specific managerial practices must be 
viewed against the broader framework of organizational structure. Essentially, 
the structure of an organization refers to the nature and interrelations of its 
units, that is, of the individuals and groups assigned to carry out its various 
functions. The links in an organizational chart indicate not only formal chan- 
nels of communication but also other organizational relations, such as lines of 
authority and influence, relative status, promotion ladders, work flow, and 
functional dependencies integrating the organization's division of labor. Or- 
ganization theory is concerned with the characteristics of organizational struc- 
ture and their relation to organizational effectiveness. In a broader sense, the 
term is also used to encompass all managerial practices, since these are so 
intimately bound up with organizational structure. 

Organizations have been described from many points of view and by repre- 
sentatives of diverse disciplines, including business administration, engineer- 
ing, economics, sociology, and psychology (25, 64). Some psychologists have 
utilized biological models to conceptualize the growth of organizations (23) or 
mathematical models to analyze their linkage systems (7). Others have fo- 
cused on qualitative descriptions of the "psychological climate" of organizations, 
using such concepts as permissiveness versus rigidity, cooperativeness versus 
competitiveness, autocratic versus democratic leadership, group conformity ver- 
sus individual freedom, and the like. Still others have employed the statistical 
techniques of item analysis to identify such organizational dimensions as de- 
gree of control exerted over members’ behavior, flexibility or informality of 
procedures, pleasantness of hedonic tone accompanying group membership, 
homogeneity of members in socially relevant characteristics, and extent of 
stratification into status hierarchies (28). 

This diversity of approaches is reflected in the varied proposals for improving 
organizational effectiveness. Among those of particular interest to psychology 
are employee participation in decision making, sociometry in work-group as- 
signments, and job design to maximize opportunities for individual responsibil- 
ity and initiative. 

Participation. One of the chief results of the human-relations movement 
in management has been a growing emphasis on employee participation in 
decision making (47-49, 69). There is some variation, however, in the nature 
and extent of participation that is advocated, the mechanics for implementing 
it, and the extent to which it involves group or individual responsibility. Some 
writers have pointed out that the optimum degree of employee partcipation is 
a function of the characteristics of the employees, the supervisor, and the 
situation (68). 

Likert (47) typifies the group-dynamics approach in his advocacy of group- 
decision processes at all levels. To this end, he proposes a type of organiza- 
tional structure consisting of many overlapping groups that extend across ad- 
jacent management levels. Each foreman, for example, would participate in 
decision-making conferences with the men in his work unit and would also 
participate with other foremen and their supervisor in similar conferences at 
the next level. Thus the supervisor in one group is a subordinate in the next, 
and so on at successive levels. Horizontal as well as vertical linkages are built 
into this system. The emphasis on groups rather than on individuals is quite 
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evident in this approach, as illustrated by Likert’s statement that “an organiza- 
tion will function best when its personnel function not as individuals but as 
members of highly effective work groups with high performance goals" (47, 
p. 195). 

On the basis of one of the Michigan studies, Likert presents some evidence 
that participation of employees in decision making does not reduce the influ- 
ence exerted by superiors but actually increases the total amount of influence 
exerted by all members of the organization (47, pp. 179-181). This, he argues, 
follows from the fact that decisions in which subordinates participate are 
more likely to be effectively implemented. When superiors have the sole au- 
thority for making decisions, they exert more influence on the decision making 
itself but exert less influence on what actually goes on than when subordinates 
participate in the decisions. 

One of the most widely quoted studies of employee participation was con- 
ducted at the Harwood Manufacturing Company, a pajama factory (9). Cer- 
tain necessary changes in the design of the garments required new work 
assignments for the operators. As is usually the case, the workers had reacted 
with considerable resistance to previous changes. Even when their earnings 
are protected, workers resent having to learn new work methods. As a result, 
production drops, and grievances, absenteeism, and turnover increase. At the 
time of the investigation, four equated work groups were formed. One group 
of 13 served as a control, being merely informed of the change in the usual 
way. Two groups of 7 and 8 workers actively participated with management in 
planning the changes. A fourth group of 18 was represented by two members, 
who participated with management in making the changes and discussed de- 
velopments with the rest of the group. 

The results with regard to productivity are shown in Figure 42. The two 
groups with full participation Cwhose results have been averaged in the graph) 
showed less drop after the changeover, faster recovery, and higher ultimate 
level of output than the control group. The group with limited participation 
yielded intermediate results. The control group also had more grievances an 
turnover after the change than did the experimental groups. Several months 
after the original experiment, the control group was given full participation 
before undergoing another change of procedure. On this occasion, its produc- 
tivity record equaled that of the experimental groups. A much later report 
from the same company shows the successful introduction of extensive changes 
in production methods with the help of a company-wide program of employee 
participation (22). 

Although the results of the Harwood studies are impressive, it should be 
noted that in such field investigations, as in much applied research, it is diffi- 
cult to identify causal factors with certainty. It is possible that at least some of 
the group differences in productivity in the first study may have resulted from 
other differences between the groups, such as the nature of the training re- 
ceived (see 75, pp. 168-169). Nevertheless, the findings of these studies are 
in line with those of other research indicating that participation in decision 
making tends to improve employee morale and job performance (47, ch. 3; 
75, ch. 9). nA 

Sociometry. Sociometric methods have been used in industry to assign in- 
dividuals to work groups, as well as in the military services to assemble crews 
and combat teams. The effect of sociometric work assignments on job per- 
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Fig. 42 Group participation and productivity. (Adapted from 
Coch and French, 9, p. 522.) 


formance was demonstrated in a study of 74 carpenters and bricklayers who 
had been working on a housing project for a period of 5 months or more (74): 
Each man was asked for his first, second, and third choice of work mates, and 
teams were formed on the basis of these preferences. Comparison of records for 
an 11-month period when these teams were in operation with the records for 
the preceding 9-month period showed statistically significant reductions in 
labor and material costs, as well as in turnover rate. The total saving in pro- 
duction costs amounted to 5 per cent. 

In this connection it might be noted that studies conducted both in industry 
and in the armed services have found high peer-group loyalty to be associated 
with better job attitudes, less tension and anxiety at work, and less absenteeism 
(47, ch. 3). The relation of group loyalty and cohesiveness to productivity is 
more complex. When the objectives of the group conflict with those of the 
organization, then productivity is likely to be lower the more cohesive the 
group. Voluntary restriction of output is a familiar example of this situation. 
But when supervisory relations and other organizational practices are such as to 
motivate the group toward high productivity, then group cohesiveness is posi- 
tively related to productivity (47, ch. 3). 

Studies on the “geography of production” (24), or the spatial arrangement 
of workers in groups, are also relevant. In an intensive investigation of as- 
sembly-line workers, for example, the number of contacts a worker had with 
his associates at work was found to be related to his expressed job satisfaction, 
as well as to turnover, grievances, and other indices of job adjustment (78). By 
its very nature, the assembly line tends to minimize such contacts. 

Further evidence of the importance of relations with co-workers was pro 
vided by a study of coal miners in England (71). When technological ax 
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vances led to certain changes in mining procedures, separate shifts of men per- 
forming one function at a time replaced the small teams that had previously 
done the whole job. This change was followed by an increase in accidents, a 
decrease in production, and other indications of weakened group morale. Grad- 
ually, other means were found to restore the work-group structure. Where this 
was done, it was observed that accidents declined and production rose. Find- 
ings such as these suggest the effectiveness of organizational procedures that 
facilitate work contacts with congenial co-workers and encourage identification 
with a work group. 

Job Design. The process of job designing includes three phases: identifi- 
cation of individual tasks to be performed; specification of the method for 
performing each task; and combination of tasks into jobs to be assigned to 
individual workers. In the effort to reduce operating costs, industrial engineers 
have relied chiefly on the principle of specialization in job design. The objects 
of specialization are to minimize skill requirements and learning time for each 
job and to limit the number and variety of tasks to be performed by each 
worker, At the same time, there has been growing recognition of the adverse 
effects of such specialization on the worker, notably through the increase in 
monotony and the loss of a sense of personal achievement. Traditionally, how- 
ever, industrial psychlogists have accepted the content of each job as fixed by 
the production process and have tried to improve worker satisfaction through 
interpersonal relations and other factors extrinsic to the work. 

Today there is evidence of a growing interest in modifying job content it- 
self so as to provide greater work satisfaction (14, 31). It might be added that 
with the increasing automation of industrial processes there is less and less 
need for human operators to perform routine tasks and more and more need 
for skilled personnel to carry on the varied and demanding jobs of mainte- 
nance, troubleshooting, and equipment development. 

Several specific attempts to reverse the trend toward job specialization have 
proved promising. At a very simple level, job rotation may reduce monotony 
by having each worker perform different operations in a rotated order, rather 
than performing a single operation all the time. In group. planning, a team of 
workers participates in specifying the content of various jobs to be performed 
by members of the group. This procedure will be recognized as typical of the 
previously discussed. group-dynamics approach. Job enlargement. tackles the 
problem more directly by designing the job in such a way that the individual 
performs a longer sequence and wider variety of operations. He may also be 
responsible for testing the quality of his work, setting up and maintaining 
equipment, and carrying out other related functions. And he may have con- 
trol over the way the job is done. : 

Some relevant data are provided by an investigation conducted in a company 
manufacturing a hospital appliance (51). The subjects were 29 unskilled 
women workers. Performance was compared under four conditions: (a) line 
design, in which a single operation was performed at each work station, pro- 
duction rate was paced by the conveyor line, and workers rotated among work 
stations every two hours; (b) group job design, in which the conveyor was 
eliminated and the workers set their own pace, but the job was otherwise un- 
changed; (c) individual job design No. 1, in which each worker performed 
all operations at her own station, controlled the sequence of assembly, pro 
cured supplies, and inspected her own product; and (4) individual job design 
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No. 2, in which the procedure was the same as in individual job design No. 1, 
but the workers were back in the main production area—in parts b and c, the 
workers had been in an adjacent room. 

The principal effect of these changes in job design was an improvement in 
quality of work. This can be seen in Figure 43, showing the percentage of 
defective assemblies in successive lots under the four job conditions. Both of 
the two individual job designs yielded significantly fewer defective assemblies 
than did the original line job design. Quantity of output dropped at fist, 
when conveyor pacing was eliminated, but after a few days of practice under 
conditions of individual job design, it rose to the line job level. It is interesting 
to note, too, that with individual job design individual differences in rate of 
work increased markedly, the better workers producing 30 to 4o per cent more 
than under line design. Workers’ attitudes were also found to be more favor- 
able with individual job design, in which each individual was personally re- 
sponsible for quality and quantity of production. 

These findings appear promising and are in line with the results of other 
studies on job enlargement (77, 79). In this particular experiment, however, 
one wonders to what extent a Hawthorne effect may have operated. It would 
have been desirable, for example, to observe the performance of the same 
workers during a fifth period in which they returned to a line job design. 

In their previously cited study on employee attitudes, Herzberg and his 
associates (31) emphasized the importance of structuring jobs so as to give the 
individual more control over the way he does his work and hence a stronger 
sense of personal achievement. Other spokesmen for the individual have 
urged—in their various ways—that he be protected against organizational pres- 
sures for conformity, against the anonymity and depersonalization of the as- 
sembly line, and even against an exaggerated emphasis on teamwork and group 
thinking. Undoubtedly many persons can function effectively and happily on 
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the job as members of a closely knit team. But others function better as in- 
dividual contributors. 

Is the individual to be rescued from the anonymity of the assembly line only 
to be plunged into the more intimate anonymity of the team? It need not be 
so. Job designing can be sufficiently comprehensive in its approach to encom- 
pass individual differences with respect to teamwork too. Some jobs can be 
performed better by a team, others by an individual. Some phases of the same 
job can be handled more effectively on a group basis, others on an individual 
basis. Some kinds of decisions can be made more judiciously by groups, others 
by individuals. Job designing should capitalize on these differences, not oblit- 
erate them. And personnel selection should take this aspect of jobs into account 
in matching persons with jobs. 


SUMMARY 


The current concern with human telations in industrial management dates 
from the classic Hawthorne Studies, which focused attention upon the relation 
of job performance to employee attitudes and morale. These studies also high- 
lighted the importance of controlling attitudinal factors in the design of psy- 
chological experiments. 

The principal procedures used in gathering data on employee motivation, 
attitudes, and morale include interviews, opinion questionnaires, attitude 
scales, projective and other indirect techniques, observational methods, and 
nominating techniques. Among the factors making for job satisfaction, workers 
generally rate security, opportunities for advancement, interest in the work, 
feeling of pride in the company, and relations with co-workers and supervisors 
as high as pay or higher. The relative weight given to these factors, however, 
varies somewhat with the age, sex, and education of the workers, as well as 
with type of work and job level. Favorable employee attitudes are associated 
with a lower accident rate and with less absenteeism, tardiness, and job turn- 
over. The relation between attitudes and productivity, however, is more com- 
plex and depends upon other individual and situational variables. There is 
also some evidence to suggest that favorable job attitudes more often arise from 
characteristics of the work itself, while unfavorable job attitudes more often 
arise from the job context, and that the former bear a more direct relation to 
productivity than do the latter. 

The attitudes and morale of any group are deeply influenced by the super- 
visory practices, communication system, and organizational structure under 
which it operates. Research on supervisory and leadership behavior has identi- 
fied the independent factors of Consideration and Initiating Structure, both of 
which are related to employee attitudes and productivity. 

Industrial communication takes many forms. It may be written or oral, 
formal or informal, individual or group, factual or attitudinal. It may flow up- 
ward, downward, or horizontally. Research has been concemed with such 
problems as the relative merits of different media; legibility, content, and read- 
ability of written communications; the development of communication skills; 
factors influencing the persuasiveness of communication; sources of communi 
cation barriers and breakdowns; and the nature and effectiveness of com 


munication networks. 
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Organizational structures have been analyzed from many points of view, and 
this diversity is reflected in the proposals for improving organizational effec- 
tiveness. Although controlled experiments pertaining to organizational theory 
are few, some research is available on the effectiveness of such organizational 
practices as employee participation in decision making, sociometric work-group 
assignments, and the redesign of jobs with reference to attitudinal factors. 
While all these approaches are promising, it must be borne in mind that there 
is no single organizational formula, management procedure, or supervisory 
practice that fits all jobs and all individuals. 
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Engineering Psychology 


Work Methods 


In a broad sense, engineering psychology utilizes knowledge of human be- 
havior in the development of effective working procedures and equipment. 
The three chapters dealing with this topic will consider in turn how the work 
is done, where it is done, and with what. The present chapter, concerned with 
work methods, will begin with an examination of the nature of work and 
fatigue and the problem of monotony in industrial work. We shall then in- 
quire into work schedules, or the distribution of work in time. This will be 
followed by a discussion of time-and-motion study and of the more compre- 
hensive approach represented by methods improvement. Finally, we shall dis- 
cuss accident prevention and industrial safety as one aspect of methods im- 
provement, as well as in their broader research implications. 

The second chapter in this section is concerned with the working environ- 
ment. In it will be considered the effects of such factors as illumination, ven- 
tilation, and noise. The third chapter deals with the design and arrangement of 
equipment and with the more inclusive problem of developing effective man- 
machine systems. 


WORK AND FATIGUE 


The principles of efficient work apply to any situation involving human per- 
formance. Although usually associated with industrial work, they are equally 
relevant to activities performed in offices, classrooms, homes, or military situ- 
ations. Basically, the concept of efficiency refers to the ratio of output to input. 
A machine that requires 5o units of power to produce 1o units of work is more 
efficient than one requiring 7o units of power to produce the same ro units of 
work. When the concept is extended to human work, many more factors need 
to be considered, both on the input and on the output side. Thus, in addition 
to the time required to perform a job, we must take into account the worker's 
energy expenditure, his feelings of strain and effort, and the emotional costs 
of persistent worries, disagreeable obligations, or unpleasant interpersonal con- 
tacts. Similarly, a long-range comprehensive measure of output would include 
not only immediate quantity of production but also quality of original con- 
tributions and other creative achievements, errors and wastage, accidents, ab- 
senteeism, turnover, grievances, labor unrest and strife, physical and mental 
health of workers, and many other indirect or remote consequences of job per- 
formance. 

It is customary to portray changes in output in the course of work by means 
of a work curve. This curve usually shows quantity of production, but it may 
also be plotted in terms of errors or other measures of performance. Figure 44 
illustrates some of the concepts commonly used in describing work curves. 
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Fig. 44 A hypothetical work curve, illustrating concepts used to 
describe performance changes. 


These include the initial “warming up” before peak productivity is reached, 
the eventual work decrement presumably indicating the onset of fatigue, and a 
possible “end spurt” as the end of the work period is anticipated. Random, 
momentary fluctuations are also characteristic of work curves. Not all the 
features shown in Figure 44, of course, need be found in any individual work 
curve. 

A major goal of human efficiency is the reduction of fatigue. Everyone 
“knows” what fatigue is, but definitions of it reveal widespread confusion (see 
7, 8). Sometimes the term is used to refer to feelings of tiredness, sometimes to 
a reduction in output, and sometimes to the physiological conditions resulting 
from continued activity. It might be argued that these three classes of phe- 
nomena are just different manifestations of the same general state of the or- 
ganism. But the difficulty is that they often disagree. Pronounced feelings of 
tiredness may be accompanied by no drop in output; output may decline in 
the absence of corresponding physiological changes; and output and feelings 
may remain unchanged for some time after the physiological onset of fatigue. 

Feelings of fatigue, moreover, may take many forms, depending upon the 
type of activity that evoked them. They may range from muscular soreness, 
localized stiffness, and aches to sleepiness, mental confusion, muscular ten- 
sions, and general weariness (74). Sometimes such feelings are difficult to 
distinguish from sheer boredom and loss of interest. The decline in output, 
too, may take different forms, such as decrease in speed of performance, de- 
terioration of quality, loss in originality, increase in errors and spoilage, rise 
in accident frequency, and greater variability of performance. Several of these 
output changes may occur together, of course. But, under certain circum- 
stances, output may continue unchanged in some respects while showing 
marked deterioration in others. 

It should likewise be noted that feelings of tiredness as well as output in- 
dices of fatigue are highly specific to the task. Transfer of fatigue to other 
tasks declines rapidly as task similarity decreases (9, 10). Even with very 
similar tasks, a sharp drop in output from continued performance of the one 
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task is unlikely to produce an equally pronounced drop in the other. Change 
of activity, even though slight, will usually decrease the effects of fatigue. 

In the light of much that was said in the preceding chapter, it should be 
apparent that both feelings of fatigue and output decrement depend in part 
upon motivation and attitude. Everyone has undoubtedly had the experience of 
tiring quickly when performing a job he disliked but working for long stretches 
with no noticeable change in output or feelings at something that fired his 
enthusiasm. It is partly for this reason that many investigators have turned to 
physiological indices of fatigue. Among the measures used for this purpose 
are heart rate, blood pressure, oxygen consumption, blood composition, and 
electrical resistance of the skin. The increase in oxygen consumption occurring 
during continuous work is the basis of the respiration calorimeter used in many 
early studies of work methods. This technique requires the subject to wear a 
breathing mask and a bag to collect the expired air. Chemical analyses of 
samples of such air show the amount of oxygen consumed, which can be 
readily translated into calories of heat production (75). 

Another index that has been investigated quite extensively is the heart-rate 
recovery curve (16). 'The more strenuous the work, the longer the time re- 
quired for the heart rate to return to a normal level. Figure 45 illustrates the 
difference in heart recovery rate following performance of a heavy manual 
job before and after it had been partially mechanized. The difference in physi- 
ological cost to the worker is evident in this graph. 

For many years psychologists have been trying to find behavioral indices of 
the onset of fatigue that would not be susceptible to the many extraneous vari- 
ables affecting output. Performance on tests that are relatively neutral with 
regard to motivational factors and that are not a part of the task at which the 


Fig. 45 Heart-rate recovery curves for a heavy manual job before and 
after partial mechanization. (From Brouha, 16, p. 15:) 
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individual has been working should provide a fairly "pure" index of behavioral 
impairment. It has generally been found, however, that short tests of any sort 
revealed little or no performance loss. Even when very tired, a person can 
usually muster his resources sufficiently to give a normal momentary perform- 
ance. 

Among the most promising behavioral indices of fatigue are tests of vigi- 
lance, which call for continued alertness (37, 65). Typically in such tests the 
individual must respond to irregularly appearing signals. Although he is re- 
quired to react only to very infrequent stimuli, he must watch for them over 
long periods. Fatigue is manifested chiefly in an increase in variability of per 
formance. By this method it was shown, for instance, that, after a 4 to 6-hour 
flight, most experienced pilots show no appreciable fatigue but, after an 8- to 
1o-hour flight, there is significant evidence of fatigue (37). Suggestive differ- 
ences were also found in the fatigue effects of flying jet as compared with 
piston-engined aircraft and in the effects of day and night flights. This re- 
search indicated that continued work may affect high-grade performance, 
which requires precise timing and coordination of various activities, before 
the appearance of any physiological signs of fatigue. The behavioral signs of 
fatigue, furthermore, may be too slight to be manifested in gross output meas- 
ures. They may show up only in an occasional slip, a brief confusion of similar 
signals, or a slight interruption of the smooth rhythm of a skilled performance. 
These minor lapses, however, may have serious consequences in certain tasks, 
such as piloting a high-speed airplane. 

From all the research that has been conducted on fatigue, one point emerges 
clearly: Fatigue is not one but many things, and each should be investigated in 
its own right. Different indices, whether they be introspective reports, physio- 
logical changes, output measures, or special behavioral tests, all have their 
place and should be chosen with reference to the specific problem under in- 
vestigation. 

A frequent question pertains to the nature of “mental fatigue,’ such as 
might result from solving mathematical problems or reading a difficult text- 
book. First it should be noted that the physiological effects of intellectual 
activity are very slight but of the same kind as those following physical work 
(see 8). It is likely that these slight changes are the result of the muscular 
responses that always accompany even the most "purely intellectual" work. For 
example, there are contractions of the eye muscles in reading; subvocal con- 
tractions of speech muscles during silent thought; postural reactions in the 
head and neck, as well as in the rest of the body; hand and arm movements 
in writing; and various muscular tensions from such tangential acts as gripping 
the pencil hard, setting the jaw, and frowning while trying to concentrate. 
Measures of muscle-action potentials taken while children or adults solve 
arithmetic problems without the use of paper and pencil show evidence of 
muscle activity which increases as the difficulty of the problems increases (22, 
43, 78). 

Among the other components of what is commonly called mental fatigue are 
general feelings of strain and effort. Boredom, drowsiness, and sleepiness are 
often included. Emotional reactions to frustration and conflict may also be an 
integral part of the experience. Frustration may arise from failure to solve a 
problem, indecision, inability to comprehend a difficult idea, and other stress 
ful experiences accompanying mental work. 
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MONOTONY AND BOREDOM 


In the preceding discussion of fatigue, frequent reference was made to monot 
ony and boredom. These feelings are difficult to differentiate from fatigue, al- 
though they can be distinguished under certain circumstances. Thus an individ 
ual may become appreciably tired from a long spell of interesting and absorb 
ing work which he at no time found boring. Or he may find a light, repetitive 
task highly boring but not feel tired from it. Nevertheless, boredom and 
fatigue are frequently associated. In a monotonous task we generally find 
greater output decrement and more energy expenditure than in a varied and 
absorbing task. Like feelings of fatigue, boredom is an unpleasant experience, 
often characterized by difficulty in keeping attention fixed on the task at hand. 
Part of the unpleasant emotional tone comes from a conflict between wanting 
to turn away to more attractive activities and realizing that one must continue 
with the dull task. 

Variability of performance has been found to be associated with the experi- 
ence of boredom. In one of the early investigations of monotony conducted by 
British psychologists for the Industrial Fatigue Research Board, output showed 
wider fluctuation for workers who reported they were bored than for those 
performing the same operations but reporting that they were not bored (93). 
This difference was observed both in the work curves of individual workers 
and in the standard deviations computed for individual productivity during 
short time units, such as ro-minute intervals throughout the day. Such vari- 
ability, which is also characteristic of fatigue in general, may result from lapses 
of attention. It may also arise from a feedback process. As the worker becomes 
aware that production is dropping off, he may put forth more effort and 
thereby raise output until the next letdown starts another cycle. 

The research conducted by the Industrial Fatigue Research Board also sug- 
gested that monotony produces a U-shaped work curve, with a production 
drop in the center of the total work period and an “end spurt” (see Figure 44) 
occurring as the period draws to a close (93, 94). It is reasonable to expect 
end spurts to be more common when output drops because of boredom than 
when it drops because of physical exhaustion. Nevertheless, it cannot be as- 
sumed that all monotonous work will produce a characteristic U-shaped curve, 
nor that all such curves indicate the presence of boredom. Many other factors 
affect the rate of production. Workers reporting severe feelings of boredom 
may show no appreciable output trend throughout the day. And the work 
curves of those who are bored may be indistinguishable from the work curves 
of those who are not (79). 

In industrial work, voluntary pacing of output during the day supersedes 
other factors that might influence the shape of the work curve. Not only is 
restriction of output for fear of rate cutting quite prevalent, but also, when no 
such fear exists, workers tend to form a concept of a proper day's work and 
adjust their pace accordingly as the end of the day approaches. The findings 
of the early British studies regarding monotony curves in industrial work, 
which have been widely quoted, are questionable owing to failure to control 
certain conditions and other methodological difficulties (see 79). 

There is some evidence to suggest that the effects of monotony may be 
greatest for tasks requiring an intermediate degree of attention (76, 93)- With 
completely routine tasks demanding a minimum of attention, the individual is 
free to engage in reverie or, if the situation permits it, to talk with his co- 
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workers. At the other extreme, a complex and varied task tends to hold the 
attention because of its intrinsic features. It is the semiautomatic task, requir- 
ing constant attention but providing little intrinsic interest, that is likely to 
produce the greatest strain. When performing such a task, the worker must re- 
peatedly resist the tendency to attend to other stimuli or to pursue other 
thoughts that have greater pulling power than the work at hand. 

It is apparent too that monotony cannot be understood solely in terms of 
task characteristics. The identical task may be boring in one set of circum- 
stances and not boring in another. Monotony depends upon the relation of 
task, surroundings, and individual. Other things being equal, of course, the 
more uniform and repetitive the task, the more monotonous it will be. In an 
intensive interview study of assembly-line operators in an automobile factory, 
the percentage of men who reported their work to be boring was 67 for those 
engaged in single-operation jobs, 56 for those whose jobs involved two to four 
operations, and only 3o for those whose jobs involved five or more operations 
(90). 

Nevertheless, the role of situational and individual variables has been re- 
peatedly demonstrated. Work may become more boring when performed un- 
der distracting conditions, because of the strain of fighting off the pull of com- 
peting stimuli. The distracting stimuli may be of many sorts, from other people 
talking or moving about to thoughts about last night's date or tomorrow's holi- 
day. On the other hand, with highly routine types of work, giving the indi- 
vidual an opportunity to converse with his neighbors may reduce monotony. 
In the next chapter we shall see that under certain circumstances even the 
introduction of noise may help to reduce monotony. Incentives also affect the 
degree of boredom experienced on a job, higher motivation tending to reduce 
monotony. 

Individuals differ widely in their susceptibility to monotony when perform- 
ing the same job. To some, even a highly repetitive job such as packing light 
bulbs is interesting; to others, a varied and complex job may be boring. Some 
relevant data, taken from one of the early British studies of industrial workers, 
are reproduced in Table 12. It will be noted that individuals differ not only in 
the absolute level of monotony reported but also in the relative monotony of 
different tasks. For worker C, for example, "staying boxes" is the least mo- 


TABLE 12 Individual Differences in Extent of Boredom Experienced 


When Performing Five Industrial Tasks* 
(From Wyatt, Langdon, and Stock, 94, p. 15. By permission of the Controller 
of Her Britannic Majesty's Stationery Office.) 


——— '—— —€——— P———— el 


Worker 
Task I ICMDnEEPUMI GCG H 1: 1] 
Packing (14 1b) 04 07 35 14 10 L1 09 32 08 3.6 
Packing (4 Ib) DSEPISSL25003. 19.05 26 1.5 10 
Staying boxes Lomo 14721 47 34430 16 12 
Making crackers TORIS 37 34 T0 03 12 20 12 
Bundling chocolates 42 46 43 48 32 44 41 45 38 40 


1 Scale extends from o to 5; higher ratings indicate more boredom. Each entry is the 
mean of the ratings assigned to the individual's replies to 13 questions. 
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notonous of the five tasks; for worker F, it is the most monotonous. Similar 
discrepancies can be found with regard to the other tasks. As these data sug- 
gest, susceptibility to monotony cannot be regarded wholly as a general trait 
but may exhibit considerable task specificity. 

On the other hand, it is likely that some common variance could be identi- 
fied in individual susceptibility to monotony, especially in a more heterogene- 
ous sample than that employed in the above study. In a sample of workers 
engaged in routine, repetitive jobs, a certain amount of selection has occurred, 
since persons highly susceptible to monotony would t2nd to leave or not to 
enter such jobs in the first place. It has been suggested, for instance, that more 
intelligent persons are more likely to find a repetitive task boring. This may be 
true when a sufficiently wide range of intelligence is considered. Among in- 
dividuals employed on a particular job, however, most investigators have found 
no relation between intelligence and reported feelings of boredom (see 80). 
Within such groups, individual differences in susceptibility to boredom appear 
to be more closely associated with personality characteristics. 

In an investigation of 72 women sewing-machine operators, for instance, sus- 
ceptibility to monotony was found to be significantly related to certain other 
indications of discontent and restlessness (80). The workers reporting stronger 
feelings of boredom more often preferred active leisure activities, disliked regu- 
larity of daily routine, and indicated dissatisfaction with their personal and 
home life as well as their factory life than did those reporting less boredom. 
The former also tended more often to be under twenty years of age. The 
investigator describes those less susceptible to boredom as being relatively 
placid, contented with the existing state of affairs, and possibly rigid in their 
behavior. This type of investigation needs to be repeated with many different 
kinds of jobs and a wide variety of workers. It is possible that the personality 
correlates of susceptibility to monotony differ with the job. 

Many remedies have been proposed for monotony in industry. Each un- 
doubtedly has something to contribute in specific situations. The findings on 
individual differences in susceptibility to monotony suggest the importance of 
personnel selection and placement, with reference to relevant personality char- 
acteristics as well as job aptitudes. The study of sewing-machine operators 
cited above opens up the possibility of identifying the more susceptible in- 
dividuals in terms of biographical and interview data. 

Changes in the working conditions and working environment may also serve 
to reduce monotony. The introduction of rest pauses often relieves monotony 
as well as fatigue. Eliminating distractions in the case of work requiring con- 
stant attention reduces conflict and strain. With more automatic jobs demand- 
ing little attention, on the other hand, such activities as conversation with co- 
workers or listening to music often help. i F : 

A third approach concentrates on making the work itself more varied or in- 
teresting, Grouping repetitive work into broad units tends to give the worker 
a sense of accomplishment as each unit is completed. Work done in batches is 
usually less monotonous than work done with a continuous—and endless— 
succession of items. Films or plant tours showing where the individual's work 
fits into the total production picture or giving information about uses and 
characteristics of the product are also sometimes employed to increase interest. 
Finally, job rotation and job enlargement, discussed in the preceding chapter, 


represent more basic attacks on the problem of industrial monotony. 
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WORK SCHEDULES 


Among the many conditions affecting efficiency of work is the distribution of 
working time. Several questions can be asked in this connection. What is the 
effect of. introducing rest pauses during the work period, and what is the best 
way to distribute and utilize these rest pauses? How is productivity related to 
total length of workday and work week? Does the time of day when work is 
done affect output? 

Rest Pauses. Investigations conducted in industry have quite consistently 
demonstrated an increase in output following the introduction of regularly 
scheduled rest pauses. In the early studies of the Industrial Fatigue Rescarch 
Board, relevant data were gathered on many kinds of industrial operations 
ranging from light to heavy work (61, 89). The results showed a rise in aver- 
age daily output, even though the employees spent less time working after the 
introduction of scheduled rest pauses. For example, in a group of women as- 
sembling bicycle chains, the introduction of a 5-minute rest at the end of each 
hour reduced total working time by 7 per cent while increasing average out- 
put by 13 per cent (89). Among the factors leading to such gains in produc- 
tivity are an increase in rate of work, reduction in errors, decrease in un- 
authorized rest pauses, and improved employee attitudes. 

An investigation of a group of women comptometer operators in a United 

States government office provided systematic data on performance before and 
after the introduction of authorized rest periods (64). The rest schedule con- 
sisted of an & minute pause in the morning and a 7-minute pause in the after- 
noon. Owing to government regulations, the working day had to be lengthened 
by 15 minutes to make up for the time spent in scheduled rest periods. Thus, 
in this study, total working time remained unchanged. Records kept during 
two-week observation periods showed a significant decrease in time spent in 
unauthorized rest pauses and a significant rise in output following the intro- 
duction of the scheduled rest pauses. 
_ The superiority of scheduled over unscheduled rest pauses probably results 
in part from better placement within the work period. For maximum effective- 
ness, rest pauses should usually be introduced just before output begins to drop 
appreciably, If introduced too late, the rest period may be insufficient to over- 
come the accumulated fatigue. If introduced too soon, on the other hand, the 
rest may simply interrupt performance at its peak of efficiency. Following such 
a rest period, the worker must go through another warming-up period (see 
Figure 44) before he again attains his previous efficiency level. 

The optimum distribution of rest pauses, as well as their individual duration, 
varies with the nature of the work and with the skill and experience of the 
worker. Heavy work or tasks demanding close concentration generally require 
more rest than lighter or more automatic work. Similarly the novice, who may 
be using less efficient procedures, tends to become fatigued more readily than 
does the skilled performer. The former may therefore require more frequent 
rest pauses than the latter. The optimal distribution of work and rest should 
be established by empirical tryouts in the particular situation. Daily output 
curves and data on the frequency of errors and accidents at different hours, as 
well as interviews with workers, help to determine the effective placement of 
rest pauses. With regard to method of using the rest periods, change of posture, 
as well as change of activity in general, is most beneficial. Refreshments and 
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conversation with fellow workers improve attitude and relieve monotony, even 
if they serve no other functions. 

It should be noted that the improvement resulting from the introduction of 
scheduled rest pauses may not be immediately evident. In the previously cited 
study of bicycle-chain assemblers, for example, it was not until the new sched- 
ule had been in effect for six months that output manifested its full improve- 
ment (89). Similar delays have been observed in other industrial studies. It 
may require several months for the previously established work habits and pac- 
ing of performance to respond to improved working conditions of any sort. 
Such findings suggest the need for continued observations in industrial re- 
search. Conclusions based on short-range follow-ups may be misleading. 

When any change is made in working conditions, moreover, it is important 
to take into consideration its possible effects on employee attitudes. The Haw- 
thorne Studies discussed in Chapter 6 bear witness to this fact. On the one 
hand, the introduction of rest pauses without adequate communication regard- 
ing the purpose of the change and without employee participation in planning 
it may arouse suspicion and antagonism. The result may be a decline rather 
than a rise in output. On the other hand, rest pauses that are welcomed by 
the employees may lead to better job performance only because the employees 
feel that management is taking an interest in their welfare. Under such condi- 
tions, a rise in output cannot be attributed to rest pauses as such, since any 
other expression of interest might have served equally well. It is undoubtedly 
true that a judicious distribution of work and rest will improve working efli- 
ciency for anyone, in any situation. But, in industrial studies, it is often diffi- 
cult to isolate the effect of this factor because of the operation of other un- 
controllable variables. 

Length of Work Week. Lengthening or shortening the workday and work 
week also affects productivity. A survey of about thirty-five hundred men and 
women workers in 34 industrial plants in the United States showed maximal 
efficiency with an S-hour day and a 4o-hour week (54). With longer hours, 
output rate declined and absenteeism and accident rate increased. 

During periods of national emergency, as in World Wars I and II, unusually 
long work weeks were instituted in some plants to meet critical shortages. It 
thus became possible to study the effect of major changes in work periods on 
job performance. A series of studies conducted during World War I by the 
Industrial Fatigue Research Board of Great Britain represents a particularly 
rich source of data on this question. The most striking finding of this research 
was that, beyond certain limits, total output was less the longer the work week. 
As this fact was recognized and the work week was again shortened, output 
rose. In one large munitions plant, for example, shortening the work week 
from 66 to 47.5 hours increased total weekly output by 13 per cent (87). 

Part of the explanation for these results is to be found in a comparison of 
the nominal with the actual number of hours worked. With longer work 
weeks, much more time is lost through absenteeism, tardiness, and unauthor- 
ized rest pauses. The rate of work is also slower during longer work periods; 
and it takes longer to reach the production peak at the beginning of the day. 
The tendency for workers to adjust their effort automatically to length of work 
period has been observed in laboratory experiments as well as in industrial in- 
vestigations. It is a natural tendency to conserve one's energy in various ways 
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as one faces the prospect of a long period of activity and to increase one's en- 
ergy expenditure when a short period of activity is anticipated. 

There may be considerable lag in the effects of a shortened work week, just 
as there is after the introduction of rest pauses. In the previously cited study 
of munitions workers, two reductions in average number of working hours 
occurred, the first from 66 to 54.4 and the second from 54.4 to 47.5. On each 
occasion, no change in output was evident for a month. At that time, output 
began to improve gradually, but did not reach its maximal level until four or 
five months after the reduction of hours. In other similar follow-ups, the lag 
in effect varied from 3 weeks to over a year, being longer for the heavier 
work (88). 

It should not be assumed that the beneficial effects of shortening the work 
period continue indefinitely. Beyond a certain limit, further reduction may re- 
sult in wasted effort. Thus with shorter periods a greater proportion of the 
total working time is spent in warming up to peak daily production. Similarly, 
a larger proportion of time is needed for preparatory and terminal activities, 
such as obtaining supplies and adjusting equipment at the beginning of the 
day or cleaning up and putting away tools at the end of the day. Since these 
activities require the same amount of time regardless of length of work period, 
they occupy a greater proportion of time in the shorter work periods. The op- 
timum length undoubtedly varies with the nature of the work and with situ- 
ational and personal conditions. Little systematic research has been donc on 
this question since the early studies of the Industrial Fatigue Research Board. 

Shift Work. The nature of many functions performed both in the military 
services and in modern industry has created an increasing need for round-the- 
clock, seven-daysa-week operation. The demand for shift workers is thus 
steadily expanding. Shift work must be evaluated not only in terms of pro- 
ductivity but also in terms of its long-range effects upon the worker's health 
and well-being. The problem is twofold, involving, first, the relative effective- 
ness of day and night work and, second, the effect of adapting to periodic 
changes in work cycle. ] 

By following socially established schedules of work, recreation, meals, and 
sleep, man has developed a daily rhythm in physiological activities as well as 
in efficiency of work (see 52, 53). Body temperature, for example, reaches a 
low point between 2 and 5 A.M. and a high point in mid-afternoon. Similar 
cycles have been observed in blood pressure, pulse rate, chemical composition 
of the blood, activity of endocrine glands, and other physiological functions. 
With regard to behavioral indices, corresponding cycles have been found in 
laboratory studies using tests of sensory, motor, and intellectual functions, as 
well as in investigations of the output of industrial workers (see 12). lt should 
be noted that by altering living schedules all these cycles can be shifted for- 
ward or backward by several hours or completely reversed with regard to day 
and night. It is more difficult, however, to lengthen or shorten the basic 24 
hour cycle, at least in some physiological processes, even when subjects live 
under experimentally controlled conditions (52, 58). 

Vernon (88) surveyed the output of shift workers performing a wide variety 
of jobs in British munitions plants during World War I. Among workers on 
a rotating shift, with rotation periods ranging from a week to a month, average 
hourly output was approximately the same on day and night shifts. It is likely 
that the disruption of daily cycles resulting from changing shifts affected both 
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the day and night performance of these workers. When continuous night 
work was compared with continuous day work in different groups, the night 
workers averaged 8 per cent lower in output than the day workers. 

In a survey of more recent investigations conducted in several countries, 
Bloom (12) notes a number of difficulties encountered by shift workers. With 
regard to night work as such, the most common problem is inability to get 
enough sleep in the daytime. In a military setting, this may be less serious 
than in a civilian community, because there is no conflict with the living habits 
of family and friends. In all situations, however, daytime noise, light, and 
heat tend to interfere with sleep. 

Even more disturbing, however, is adaptation to changing cycles. Studies of 
workers on rotating shifts report high frequency of errors, high accident rate, 
disturbances in eating and sleeping, and adverse effects on general health. In 
one survey, gastric ulcers were eight times as common among shift workers as 
among day workers (see 12). Because of these adaptive difficulties, Bloom 
proposes that fixed shifts may be preferable to rotating shifts. If rotation is un- 
avoidable, the periods between shift changes should be long enough to permit 
adaptation. Workers should also have several days off between changes. It has 
been noted, for instance, that a flight across the Atlantic, with its five-hour 
change in daily schedule, requires about a week for complete adjustment. 
More extensive changes, as from day to night shifts, may require longer ad- 
justment periods. Much more research is needed, however, to determine the 
effects of different shift systems and the course of adaptation to shift changes. 
Insofar as there seem to be individual differences in adaptability to change in 
daily cycles, selection of personnel for shift work should take such differences 
into account. 


TIME-AND-MOTION STUDY 


Contributions of Taylor and Gilbreth. In 1881 Frederick W. Taylor (21; 
67, pp. 67-113; 83), a mechanical engineer, began his. observations of steel- 
workers which led to the development of time study. His aim was to increase 
productivity by improving the performance of the workers. This approach 
represented a radical departure from engineering practices prevailing at the 
time, since it shifted attention from the machines to the worker. Even in the 
simplest unskilled operation, Taylor argued, the way in which the worker 
carries out the job can make a big difference in productivity. Time study was 
essentially a process of analyzing an operation into its component parts and 
noting the time required by the best workers to perform each part. Although 
Taylor defined time study to include the elimination of useless movements and 
the selection of the best way to perform each of the remaining movements (21, 
vol. 1, p. 227), little was done along these lines prior to the work of Gilbreth, 
to be discussed below. Taylor himself concentrated on establishing optimal 
times for different operations and setting pay rates in accordance with such 
times, " 

Taylor was also a strong advocate of the selection and training of workers 
and of the introduction of rest pauses. In a widely quoted demonstration, he 
increased the output of a pig-iron handler from 12% to 47% tons per day. This 
increase was effected by instituting a schedule of rest pauses that required the 
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worker to spend 58 per cent of the working day at rest (83, p. 60). The inter- 
pretation of these findings, however, is complicated by the influence of mo- 
tivational factors, since the worker was also offered a higher rate of pay for 
meeting the prescribed production standard. 

Taylors timestudy procedures were incorporated into the more elaborate 
and refined techniques of motion study developed by Frank B. Gilbreth (39- 
41; 67, pp. 245-291). Like Taylor, Gilbreth was an engineer; his wife, who 
collaborated in much of his work, is a psychologist.1 Gilbreth's observations of 
many kinds of workers convinced him of the prevalence of wasted effort. 
When left to their own resources, he noted, most workers use many unneces- 
sary and inefficient movements. It was chiefly through the elimination of un- 
necessary movements that Gilbreth believed output could be increased and 
fatigue decreased at the same time. In one of his most famous investigations, 
he was able to reduce the number of separate movements required in brick- 
laying from 18 to 5. He thereby increased the average number of bricks that 
could be laid in an hour from 120 to 350. 

In motion study, movements are broken down into much smaller elements 
than was done in the earlier time study of Taylor. In mapping out the path of 
motion followed by the hand of the operator, Gilbreth utilized many special 
observational aids, such as photographs, moving pictures, stereoscopic slides, 
and three-dimensional wire models. The motion-picture camera is now stand- 
ard equipment in time-and-motion analysis. 'To assist further in the description 
and recording of industrial operations, Gilbreth devised a set of 17 units in 
terms of which any work cycle can be analyzed. These units are called "therb- 
ligs" which is “Gilbreth” spelled backward but with the "th" unreversed. 
Figure 46 lists the 17 therbligs, together with abbreviations and symbols em- 
ployed in rapid recording. A standard color code was also developed for use in 
the graphic representation of sequence and duration of each elementary move- 
ment. "Through an examination of the films and charts showing exactly what 
is done by each hand of the operator and the time spent in each activity, the 
time-and-motion analyst eliminates unnecessary movements and rearranges the 
remaining movements in what appears to be the quickest and easiest pattern. 

In working out an improved method for performing an industrial operation, 
the time-and-motion analyst is guided by certain commonly accepted rules, or 

principles.” Lists of such principles can be found in standard contemporary 
works on time-and-motion analysis Ce.g., 4, 11, 70). One of the most important 
rules is to minimize the number of motions by eliminating unnecessary move: 
ments. Another is to minimize the length of motions, as illustrated by reducing 
the distance that the worker must reach in obtaining tools or materials or in 
operating machinery. Other common objectives are to utilize rhythmic mo- 
tions whenever feasible and to employ continuous, curved movements in prefer- 
ence to Straightline movements involving sudden and sharp changes of direc 
tion. Still another example is the principle of bimanual symmetry, providing 
for is use of both hands simultaneously in opposite and symmetrical move 
ments. 

_ Critique of Traditional Time-and-Motion Study. When first introduced, 
time-and-motion study did not prove to be the boon that its enthusiastic ex 


1 This couple and their 12 children were the subject of a popular book and a motion 
picture entitled Cheaper by the Dozen (42). 
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Therblig Explanation of symbol 


Abbrevi- 
ation 


Eye turned as if searching 
Select St Reaching for object Gray, light 
Grasp G eic open for grasping 
E^ TE w^ | Empty hand Olive green 
Ert TL ni and with something Green 
Hold H Magnet holding iron bar Gold ochre 


Release load RL 


Dropping content out 
of hand 


Object being placed 
by hand 


Position 


A nine-pin which is set up 


Pre-position PP in a bowling alley 


Magnifying lens 


Several things put 
together 


removed 


Inspect 


Assemble 


Disassemble 


Use Word "Use' Purple 


Yellow 
ochre 


Unavoidable 
delay 


Man bumping his nose, 
unintentionally 


Avoidable 
delay 


Man lying down on job Lemon 
voluntarily 
Man with his fingers at 
his brow thinking 
Man seated as if resting 


Fig. 46 The therbligs used in time-and-motion study. CAdapted from 
Barnes, 4, p. 117.) 


Rest for over- 
coming fatigue 


we view it in retrospect from the vantage point of 
industrial psychology, we can recognize many weaknesses in its original ap- 
proach. The atomistic treatment of movements overlooked the fact that the 
speed of a given movement is influenced by the other movements that precede 
and follow it. The identical element of motion may require appreciably more 
time when performed in one context than when performed in another Cs, 
45, 91). A related objection is that a movement pattern that looks simple on 
a chart may be awkward and fatiguing to the human operator. Today more 


ponents had anticipated. As 
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emphasis is put on the anatomic and behavioral characteristics of the human 
operator and less on the geometry of the path of motion. 

From the psychologist's viewpoint, a fundamental weakness in the approach 
of the early timeand-motion analysts was their failure to recognize individual 
differences in work methods. They were firmly convinced that there is "one 
best way" for performing each job. We now know that there may be an in- 
teraction between work methods and individual variables. As a result of dif- 
ferences in physical or psychological characteristics among workers, one 
method may be best for individual A and another method for individual B 
C2, 25). An interesting illustration is provided by research on the performance 
of older workers (92), showing that older persons tend to modify their method 
of performing a task so as to compensate for any decline in perceptual skills 
and other minor disabilities. When a task permitted a variety of approaches 
and the method was largely under the subject's control, compensatory changes 
occurred and performance showed no age decrement. In tasks whose perform- 
ance was rigidly set by the conditions of the experiment, on the other hand, 
performance declined with age. 

In time-and-motion analysis, the pattern of movement and the time for each 
part movement are generally derived from observation of a small number of 
workers. Usually the most successful performers are chosen for this purpose. It 
cannot be assumed that the method so derived will be the most effective for 
other individuals, who may differ in degrees of skill, experience, and other 
characteristics. 

A fourth criticism pertains to the criterion of efficiency used in traditional 
time-and-motion analysis, namely, speed of production. More attention needs 
to be given to other effects of work methods, such as energy expenditure and 
fatigue, errors and other qualitative aspects of output, accidents, and employee 
attitudes. 

An even more important factor is the workers’ misunderstanding and sus 
piciousness regarding the objectives of time-and-motion study. Employees often 
fear that the result may be simply a speedup of production, to be followed by a 
drop in pay rate and the dismissal of some workers. The fact that time-and- 
motion study did sometimes lead to these consequences in the past has done 
much to delay its acceptance by workers, In addition, there may be a general 
distrust of the “outsider,” the timeand-motion analyst who introduces the 
change. Here, too, it might be noted that an outsider can go wrong. Not being 
as close to the job as the workers themselves, he may overlook important points 
and make impracticable recommendations. 

The following passages are taken from a parody of what an outsider might 
do if he tried to increase the operating efficiency of a symphony orchestra. 
They are part of a report prepared by a mythical work-study engineer after 
attending a concert, 


For considerable periods the four oboe players had nothing to do. The number 
should be reduced and the work spread more evenly over the whole of the concert, 
thus eliminating peaks of activity. All the twelve violins were playing identical 
notes; this seems unnecessary duplication. The staff of this section should be 


drastically cut. If a larger volume of sound is required, it could be obtained by 
means of electronic apparatus. . 


There seems to be too much repetition of some musical passages. Scores should 
be drastically pruned. No useful purpose is served by repeating on the horns 4 
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passage which has already been handled by the strings. It is estimated that if all 
redundant passages were eliminated the whole concert time of two hours could be 
reduced to 20 minutes and there would be no need for an interval. . . . 

There were excessive reaches for some notes on the piano and it is probable that 
redesign of the keyboard to bring all notes within the normal working area would 
be of advantage to this operator. In many cases the operators were using one hand 
for holding the instrument, whereas the use of a fixture would have rendered the 
idle hand available for other work. It was noted that excessive effort was being used 
occasionally by the players of wind instruments, whereas one air compressor could 
supply adequate air for all instruments under more accurately controlled. con- 


ditions (47). 


Finally, the principles of time-and-motion study need to be more fully 
checked in well-controlled research. Some are based only on “common sense” 
and are essentially hypotheses to be tested. Others are based on early studies 
with many uncontrolled variables. The advantages of using both hands simul- 
taneously, for example, has been questioned by the findings of at least two 
well-controlled studies. One investigation was concerned with a routine man- 
ual operation, in which the subjects had to pick up screw nuts from a bin and 
drop them into a receiver hole (6). For this task, a single-handed operation 
proved to be faster than a simultaneous bimanual operation. The other study 
compared a bricklaying method involving the use of only one hand at a time 
with Gilbreth’s method involving bimanual symmetry of movement (57). By 
means of a specially devised instrument, it was shown that Gilbreth’s bimanual 
method requires more physical effort on the part of the worker than does the 
new method that departs from this motion principle. These results should 
not be interpreted to mean that the principle of bimanual symmetry ought to 
be discarded, In most situations, it may still be effective to use both hands in 
symmetric movements. But the principle cannot be applied indiscriminately, 
without due regard for other concomitant circumstances. 


MODERN APPROACHES TO METHODS 
IMPROVEMENT 


Early misuses and abuses of time-and-motion study should not obscure the in- 
trinsic merits and potential contributions of this technique. Today time-and- 
motion study is used widely. Although its current applications do not always 
meet all the objections cited above, the more serious faults have been cor- 
rected. n : 

Worker Participation. In line with present recognition of the importance 
of employee attitudes, there is a growing tendency to have workers participate 
in any methodsimprovement program. With good two-way eror o m 
many suggestions for improving work methods will originate with the workers 
themselves. If workers become “motion-minded,” moreover, they can readily 
discover ways of eliminating waste motion and increasing operating vgl 
It has been proposed that workers and supervisors should be trained in 4 
basic techniques of work simplification, so that they may themselves apply 
these procedures, with technical assistance and advice from experts viec 
needed (see 59). Under these conditions, work methods can remain sufficiently 
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flexible to allow for individual differences. No longer need the "one best way" 
be imposed upon all individuals by an outside expert. 

Broad Scope. A second characteristic of current methods-improvement 
procedures is their breadth. Time-and-motion study is only one of the tech- 
niques used. But work methods include more than movements. Attention 
must also be given to the improvement of the perceptual and intellectual as- 
pects of work processes (77). Any activity could probably be performed more 
efficiently if persons habitually engaged in it were to examine it systematically 
from this point of view. 

Methods-improvement programs merge imperceptibly with equipment design 
and layout, to be discussed in Chapter 9, as well as with ways of improving 
the working environment, to be discussed in Chapter 8. Some are concerned 
with work performed by a group of persons functioning as an integrated team. 
In this respect they border on some of the organizational and supervisory prob- 
lems considered in Chapter 6. 

The broader scope of the present approach is also evidenced in the utiliza- 
tion of more comprehensive criteria for evaluating efficiency of work methods. 
More use is now made of qualitative indices of output, such as errors, rejects, 
and spoilage, in addition to measures of sheer quantity. Safety has also become 
a major goal in the improvement of work methods. 

Basic Research. Still another feature of the present approach is to be 
found in the growing body of basic research on psychomotor performance. 
This is reflected in the very categories used to classify human movements. In 
contrast to the task-oriented therbligs of traditional time-and-motion study, 
increasing use is being made of categories that focus upon the nature of the 
motor responses themselves. Thus we now speak of positioning, repetitive, 
continuous, serial, and static responses—the last involving the use of muscular 
exertion to maintain a required position. From a still different angle, factor 
analysis has been used to identify basic motor skills, such as reaction time, 
manual dexterity, finger dexterity, and arm-hand steadiness.” 

The close collaboration of psychologists and engineers in the newly emerg- 
ing field of engineering psychology has done much to stimulate such basic 
research. The interest of the military services in these problems has been an- 
other important contributing factor. A survey of much of this research can be 
found in books on engineering psychology (e.g., 62). Only three examples 
will be cited here. 

As one illustration, we may consider research on methods of carrying loads 
(see 84). Despite the development of mechanical devices, loads are still being 
carried by many kinds of workers, from construction laborers to restaurant 
waitresses—not to mention students who carry books! Experimental studies in 
this area have used several measures of energy cost to the organism, such 
as pulse rate and oxygen consumption. Output indices have also been em- 
ployed, as illustrated by time required to carry a standard load over a pre 
scribed course by different methods. By these procedures it has been established 
that the optimal weight of load is about 35 per cent of the carrier's body 
weight and the optimal rate is from 85 to 95 yards per minute. Faster rates 
and heavier loads result in greater energy expenditure. The best position in 


2 For a brief summary of this factor-analytic research, as well as references, see 
Anastasi (1, pp. 383-385). 
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carrying loads is the one that interferes least with the carrier's normal balance 
and center of gravity. This optimal position varies somewhat with the weight 
of the load. High positions, as on a tray, are better for lighter loads, while low 
positions are better for heavier loads. Similarly, the shape of the load may 
affect the carrier's posture and hence the efficiency of operation. 

The second example concerns the accuracy of blind positioning movements. 
'This is the type of movement required when a person operates some control 
device without looking at it, usually because he must look at something else at 
the time. When landing an airplane, for example, the pilot is unable to look 
at the lever that lowers the flaps but must reach for it blindly. In an experi- 
ment on this problem (26), the subjects wore blindfolds and were required 
to touch various targets with a sharp-pointed marker, as shown in Figure 47. 
There were three tiers of targets, the middle one being at approximately 
shoulder level and the others at about 45° above and below. Horizontally, the 
targets extended from front to 135° to the subject's right and left. The rela- 
tive accuracy of movement in the different directions is shown in Figure 48. 
Mean error scores for each target position were found by assigning o to a 
mark on the bull’seye, the scores 1 to 5 to marks in successive concentric 
circles, and a score of 6 to any mark falling outside the outermost circle. It 


Fig. 47 View of left side of target stand used in study of blind 
positioning movements. (From Fitts, 26; photograph by courtesy of 
Aerospace Medical Research Laboratories, United States Air Force.) 
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Fig. 48 Accuracy of blind positioning movements in different directions. 
Each circle represents the location of a target. Size of circle is proportional 
to average error score, which is also given within circle. CAdapted 

from Fitts, 26, p. 214.) 


will be seen that accuracy is greatest for the front positions and least for the 
extreme lateral positions. With regard to level, accuracy tends to be higher 
for the lowest tier and lowest for the highest tier. 

A final example is provided by a set of experiments dealing with serial 
movements. For this research, the investigators designed a type of apparatus 
which they called the Universal Motion Analyzer, illustrated in Figure 49. 
The principal feature of this apparatus is that different elements of the move- 
ment sequence can be separately timed by electric clocks that are started and 
stopped by the subject's own responses. Several different response panels were 
used in different experiments. 

The experiment illustrated in Figure 49 was designed to determine the 
effect of complexity of movement pattern on speed of movement. The panel 
provides 34 switches that can be turned on by the subject. In this experiment 
(73), only 5 were used in each of four movement patterns. The first pattern 
consisted of a single row of 5 switches, to be turned on successively. The 
second pattern required one change of direction, from horizontal to vertical; 
the third required two changes, and the fourth required three. Contrary to 
expectation, it was found that speed of movement was not affected by pattern 
complexity. Neither travel time from one switch to another nor manipulation 
time in turning on each switch was any longer when direction of the subject's 
hand movement changed than when it did not change. 

Of particular interest for time-and-motion study are two experiments in this 
series showing that the time required to perform the same movement varies 
with the context. In one of these experiments, manipulation time required to 
turn on switches was longer, the longer the distance traveled by the hand from 
one switch to another (91). In the other, the distance traveled remained con- 
stant, but the type of manipulation required to turn on the switches was 
systematically varied (45). The manipulations included, among others, pulling; 
pushing, turning, and dial setting. The most noteworthy finding was that travel 
3 Conducted under the direction of K. U. Smith, these experiments are reported iP 
a series of articles appearing in the Journal of Applied Psychology from 1951 to 1959 
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Fig. 49 Universal Motion Analyzer used in experiments on effect of 
complexity of movement pattern upon speed of movement. (From 
Rubin, Von Trebra, and Smith, 73, p. 273.) 


time varied significantly as a function of manipulation. When the subject had 
to perform a more time-consuming manipulation, as in setting dials, his hand 
also moved more slowly from one position to the next than it did when he 
performed quicker manipulations. It thus appears that, when the subject is set 
for more precise and slower movement, this set carries over to all components 
of the movement sequence. For this reason, it is necessary for the time-and- 
motion analyst to check the times of the total movement sequence in its final 
form, rather than relying on estimates obtained for separate components or for 
these components observed in a different sequence. 


ACCIDENT PREVENTION 
Considerable effort is now devoted to the problem of industrial safety. The 
procedures employed in safety programs vary widely, from the development of 
safety devices and protective clothing to safety campaigns and competitions. 
One of the recognized objectives of methods improvement is to increase oper- 
ational safety. Studies of fatigue, rest pauses, length of work week, and shift 
work have frequently employed accident rate as a criterion. In fact, accident 
prevention is related to nearly every topic in industrial psychology, including 
selection, training, and employee attitudes. For several types of jobs, screening 
out individuals likely to have many accidents is an important aspect of per- 
sonnel selection. Training employees to follow safe operating procedures is a 
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significant part of many training programs. And the role of employee attitudes 
in the observance of safety rules and in other behavior related to accidents is 
widely recognized. Characteristics of the working environment and equipment 
design have also been investigated with reference to safety. 

Psychology's contribution to industrial safety consists chiefly in research on 
the causes of accidents. Some typical approaches will be examined below. They 
have been chosen because they either illustrate important methodological prob- 
lems or concern some explanatory concept that has been used widely in acci- 
dent research. 

Accidents and Personal Characteristics. lt has been estimated that 
about 88 per cent of industrial accidents are caused by human factors, that is, 
by something that a person does or fails to do (46, p. 21). Such causes include 
not only unsafe operating procedures but also faulty or inadequate inspection 
of equipment or materials. Thus even accidents attributed to equipment f ailure 
may in many cases be traceable to human errors. It is therefore understandable 
that many investigators have looked for a relationship between accident rate 
and personal characteristics. A large number of variables have been studied 
in this connection, including intelligence, educational level, special job-related 
aptitudes, emotional and social traits, attitudes, sensory deficiencies, health and 
physical characteristics, job experience, age, sex, and socioeconomic back- 
ground factors. Although a few clear-cut relationships have been established, 
results are often difficult to interpret because so many of these variables are 
interrelated. 

Some of the methodological problems involved in isolating the contribution 
of different factors are illustrated in a well-designed study of the relation of age 
and experience to accident rate (86). A frequent difficulty in evaluating the 
relation of job experience to accident rate stems from selective turnover. Since 
workers with the worst accident records tend to be separated from the com- 
pany, the longer an employee remains on a job the more highly selected is he 
likely to be with regard to safety. This progressive selection could bring about 
a spurious negative relation between length of job experience and accident 
rate. In the above study, monthly accident records were plotted separately for 
1,237 employees who remained with the company during a five-year period 
and for 1,317 employees who were hired at the same time but who had left or 
been discharged during that period. Figure 50 shows the results. It can be seen 
that accident rate drops sharply for both groups during the first five months on 
the job. Beyond that point, however, there is a general leveling off in both 
groups. Of particular interest is the finding that, after the initial adjustment 
period, accident rate is consistently higher for the turnover than for the non- 
turnover group. 

In the same study, accident records were similarly plotted for 387 workers 
hired after a systematic training program had been installed in the company: 
These men thus had the benefit of formal training in correct job procedures 
and safety methods. The result was a lower initial accident rate and a more 
rapid drop than in the untrained group. The trained group required only three 
months of job experience to reach the safety level achieved by the untrained 
group after five months. 

In still another part of the same study, age was found to be significantly 
associated with accident rate, when the effect of job experience is ruled out. 
Figure 51 gives the average monthly accident rates for three groups of work- 
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Fig. 50 Relationship between job experience and average monthly 
accident rate for a non-turnover and a turnover group. (From Van 


Zelst, 86, p. 315.) 


ers. Groups A and B were equated in job experience, each having spent an 
average of three years on their present job. Group A, however, was younger 
than Group B, their mean ages being 29 and 41 years. It can be seen that the 
younger group had a consistently higher accident rate throughout the experi- 
mental period. Group C was in the same general age bracket as Group B 
(mean age 39 years) but unlike the first two groups had no job experience. It 
is interesting to note that after the usual adjustment period this group's acci- 
dent rate dropped to a level that practically coincided with that of Group B. 
In studies of the relation between frequency of traffic accidents and personal 
characteristics of drivers, it is necessary to control mileage and exposure 
hazard. Thus, if drivers in certain age ranges drive more miles per year than 
those in other age brackets, or if they drive under more dangerous conditions 
with regard to weather, illumination, or traffic volume, they are more likely to 
have accidents. Yet only a few studies of the characteristics of “safe” and 
"unsafe" drivers have adequately controlled these factors. The sort of pro- 
cedure needed for this purpose is illustrated by an investigation of 482 street- 
car operators in a large Middle Western transit company (55). Individual ac- 
cident expectancy was computed in terms of vehicle mileage operated and 
variations in accident frequency among routes, seasons, days, and hours. An 
accident index was then found for each operator by dividing the actual number 
of accidents over a three-year period by the expected number. Comparison of 
this index with each of 76 medical and personal items yielded low correlations, 
of which only about a third reached statistical significance. Although many of 
these characteristics may have contributed to accident causation in individual 
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Fig. 51 Relationship between age of employee and average monthly 
accident rate. (From Van Zelst, 86, p. 316.) 


cases, none exhibited a sufficiently close relation to accident rate to permit 
adequate prediction. 

Surveys of automobile drivers have generally failed to reveal any close 
association between personal characteristics and accident frequency, although 
measures of attitudes and personality factors appear somewhat more promising 
than other variables investigated (13, 15, 17, 18, 44, 82). Even in the case of 
personality characteristics, however, it seems likely that different response 
patterns account for accidents on the part of different individuals. For example, 
some may have accidents because of habitual unconcern with the rights of 
others, some because of a manic tendency to hurry at all times, some because 
of strong feelings of hostility and competitiveness toward other drivers, and so 
on (17). It is also interesting to note that two studies of young automobile 
drivers found no significant relation between accident frequency and either 
habitual driving speed or number of traffic violations (20, 81). All these re 
sults suggest that the type of behavior that produces accidents may be quite 
specific and cannot be identified in terms of broad personality traits or general 
driving habits. : 

Accident-proneness. The interpretation of accident statistics is beset with 
pitfalls. An example is provided by the concept of “accident-proneness,” which 
has been defined as the tendency of an individual to retain his relative accident 
liability over a long period of time. Often this concept has been used with the 
further implications of generality and causality. Thus it is assumed that à 
highly accident-prone person will tend to incur many accidents in all situations 
from opening a can to driving a car, and that his accident liability is a reflec: 
tion of personal characteristics leading to unsafe behavior. The distribution 
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accidents within a group has often been cited as evidence of accident-prone- 
ness, For example, if ro per cent of the group have 5o per cent of the acci- 
dents, these individuals are identified as accident-prone. It would then seem 
that, if we could either remove these persons from hazardous jobs or possibly 
retrain them, we should be able to cut the accident rate in half. 

The fallacy in this reasoning stems from the assumption that by chance 
everyone should have the same number of accidents. Hence any marked devi- 
ation from the average accident rate is regarded as accident-proneness. Actually 
the distribution of accidents obtained in most groups closely approximates the 
distribution expected by chance (69). With rare events, such as serious acci- 
dents, this distribution is highly skewed, since most persons will have no acci- 
dents during the period covered.* Suppose a total of 1o accidents occur in a 
group of 100 persons. Obviously at least go persons will have had no accident. 
In fact, since some will have two or even more accidents by chance, more than 
90 will have had no accident. As the total number of accidents in the group 
increases, the distribution becomes less skewed and approaches a symmetrical 
curve, This is illustrated in Figure 52, which shows the distributions of acci- 
dents among 59 streetcar motormen over increasingly longer periods, ranging 
from 1 to 13 months (38, pp. 343-344). The longer the observation period, 
the larger the total number of accidents and hence the more nearly symmetri- 
cal the curve. 

The only sure test of accident-proneness is to compare the accident records 
of the same individuals for two periods of time (66). In this way we can 
ascertain whether the same individuals have high accident records during both 
observation periods. The correlations that have been found between accident 
records for two periods vary widely, averaging about .4o (38, p. 344; 51; 69). 
When viewed as a reliability coefficient, such a correlation does not appear 
1 When the number of events (e.g. accidents) is quite small compared with the num- 
ber of persons to whom such events can happen, the appropriate chance distribution is 
the Poisson distribution (see 66, 68, 69). 


Fig. 52 Distribution of accidents among 59 streetcar motormen for 
various periods of time. CAdapted from Ghiselli and Brown, 38, p. 343.) 
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very high. It suggests that an individual's past accident record, taken by itself, 
is a poor predictor of his subsequent accident liability. It should be added that 
even lower correlations are found when the frequencies of different kinds of 
accidents are correlated. Moreover, the correlations obtained undoubtedly re- 
flect in part stable environmental conditions, which contribute to the consist- 
ency of individual accident records. For example, individuals performing 
relatively hazardous jobs will be doing so during both observation periods and 
will thus tend to have higher accident rates in both periods than will workers 
in safer jobs. 

A striking demonstration of the inadequacy of accident-proneness to account 
for traffic accidents is provided by a re-analysis of the accident records of about 
29,500 Connecticut drivers over a six-year period (27). The original survey in 
which these data were gathered showed that 3.9 per cent of the drivers ac- 
counted for 36.4 per cent of the accidents during the six-year period (23, P- 
196; 85). This finding was widely quoted in support of the concept of accident- 
proneness. Re-analysis of the same data with regard to the first and second 
three-year intervals, however, revealed that the high-accident records during 
the two periods were obtained largely by different individuals. As a result, if 
those drivers having two or more accidents during the first three-year period 
had been prevented from driving during the second period, the total number 
of accidents during the second period would have been reduced by only 3.7 per 
cent. To put it differently, those identified as "safe drivers" during the first 
three years (because they had either one accident or none) accounted for 96.3 
per cent of all the accidents during the next three years. When properly in- 
terpreted, the findings on accident-proneness suggest that lasting personal char- 
acteristics play a relatively minor part in the overall occurrence of accidents 
and that situational factors play a relatively large part. 

Accidents and Psychological Climate. Among the situational variables 
investigated in relation to accident rate are various working conditions sub- 
sumed under the concept of “psychological climate” (50, 51). One group of 
investigators (50) analyzed all lost-time accidents of 7,103 employees of a 
tractor factory over a five-year period. Mean accident rate per person was com- 
puted for each of 44 shop departments. The same departments were also rated 
on a number of situational variables by a panel of eight raters from the com- 
pany's managerial personnel. The mean accident rate for departments was 
then correlated with each of these rated situational variables. Partial correlation 
technique was employed in the effort to tease out the effects of variables that 
were themselves intercorrelated. 

One of the closest relationships found in this study was that between acci- 
dent rate and physical discomfort of the shop environment, as from excessive 
heat, noise, vibration, or dust. The investigators concluded that the annoyance 
and distraction occasioned by these conditions increase accident liability. It 
was also interesting to note that the presence of an obvious danger, such as 
glowing molten metal, increased the likelihood of unrelated accidents that did 
not involve the particular hazard. It may be that the presence of an obvious 
danger reduces the attention given to other, less conspicuous accident hazards. 
Another finding of special interest was that amount of crew work was positively 
correlated with accident rate. Crew work calls for coordination and synchro 
nization of the actions of crew members; it also requires more vigilance in 
accepting and relinquishing responsibility in the performance of a group task 
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than is needed in individual work. These conditions may increase the accident 
risk in crew work. 

In a survey of published findings pertaining to psychological climate and 
accident rate, Kerr (51) proposes two major explanatory mechanisms, based 
upon alertness and stress. First he argues that accidents are simply one kind of 
low-quality work behavior, similar to errors or spoilage. Conditions that im- 
prove the quality of work tend also to reduce accident rate. One of these is 
alertness; another is freedom from distracting negative stresses. With regard to 
alertness, Kerr maintains that a psychological climate that encourages passive 
compliance and discourages individual initiative is likely to reduce alertness 
and thereby increase accident frequency. One survey of different factory de- 
partments, for instance, found a correlation of —.4o between promotion possi- 
bilities and accident rate. In the same survey, departments with the best sug- 
gestion records tended to have the fewest accidents. 

Among stress factors, Kerr includes internal as well as external conditions. 
"The former can be illustrated by disease, alcohol, and worry; the latter, by tem- 
perature excesses, poor illumination, and disturbing noise level. Unlike the 
constitutional factors implicit in some accident-proneness theories, such stress 
factors need not be permanent. It is apparent that psychological-climate 
theories of accidents link the problem of industrial safety with questions of 
organizational structure and supervisory practices discussed in Chapter 6. 

Task Variables. With the growing realization that accident-proneness and 
personal characteristics account for relatively few accidents, attention has 
turned more and more to the contribution of task variables. Any conditions 
that increase the complexity or difficulty of the operator's task will lengthen 
response time and raise the likelihood of error (28). When a driver becomes 
confused because of inadequate road marking, for example, he is likely to slow 
down or stop abruptly, make an improper turn, get on a one-way lane in the 
wrong direction, or engage in other unsafe and hazardous behavior. i 

Psychologists first approached the problem of traffic accidents by developing 
tests for the selection of drivers (23, 56). A later development was the estab- 
lishment of “drivers’ clinics” for rehabilitating accident-prone drivers by mak- 
ing them aware of deficiencies and of faulty operating habits Q3, 49 56). 
More recently, interest has shifted to the application of human engineering to 
the design of highways and equipment (28, 29, 31). For example, research 
has been conducted on the dimensions and placement of controls, displays, 
seats, and windows in buses and trucks (63). A number of studies have been 
concerned with characteristics of highway signs, such as position, size, color, 
and lettering (33; 35; 62, PP- 113-114). Others have dealt. with the relative 
perceptibility of stop signs and blinker lights (48), the interpretability of 
traffic symbols (14, 32), and driver reactions to various highway markings and 
illumination systems at a cloverleaf interchange (24). ee 

Drivers’ responses to the amber phase of traffic signals (71) and drivers 
judgments of the speeds of their own and of other cars on the highway (3, 
72) have also been investigated. It has been noted, for example, that on super- 
highways the absence of reference objects on the roadside makes it more 
difficult to estimate the speed of other cars (28). Research has also been con- 
ducted on the conditions affecting the rate of flow o£ both highway and tunnel 
traffic (3o, 36). These conditions may produce accident hazards as well as 


traffic stoppages and delays. 
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Much of the research on traffic problems cited in this section deals with the 
designing of equipment and with man-machine systems, to be discussed in 
Chapter 9. Other studies have investigated the effects of fatigue, loss of sleep, 
and alcohol on driving behavior (see 23, 29, 34, 56, 60). These conditions 
create serious accident hazards by reducing vigilance, alertness, sensorimotor 
coordination, and speed of response. It is apparent that the study of accidents 
has been approached from so many different angles that it cuts across several 
fields of applied psychology. 

Near Accidents. In many situations, the accumulation of data on a suffi- 
cient number of accidents to permit adequate statistical analysis takes a long 
time. To obtain enough cases, the investigator may have to combine different 
types of accidents or accidents occurring under different conditions, thereby 
increasing the difficulty of interpreting results. Partly for these reasons, some 
investigators have turned to the study of near accidents, or “close calis" (see 
19, ch. 3). Another advantage of studying near accidents is that pe-ple are 
usually less reluctant to report near accidents than full-fledged acci *nts in 
which they were involved. Since the potentially dangerous situations icading 
to near accidents are often similar to those producing actual damage, near 
accidents provide a fruitful field of research. 

Systematic studies of near accidents have been conducted with Air Force 
flight personnel and with long-haul truck drivers, among others (19, ch. 3). 
Data can be gathered in various ways, as by having an observer ride in the 
truck with the driver, by having individuals record their own near accidents 
as they occur, or by means of a critical-incident technique. In one Air Force 
study, for example, a large number of pilots were asked whether they had 
ever made or seen anyone else make “an error in reading or interpreting an 
aircraft instrument, detecting a signal, or understanding instructions" (19, p. 
89). If they had, they were requested to describe the incident. Tabulation of 
teplies revealed that the two most common errors were reversal errors ( e.g., in 
interpreting direction from compasses) and errors in interpreting multirevolu- 
tion instruments (e.g., in reading an instrument having more than one pointer 
or one with a pointer and a rotating dial). Such findings helped to pinpoint 
potential sources of danger and led to research on the redesign of these instru- 
ments. 

In view of the many difficulties encountered in the analysis of accident 
records, the investigation of near accidents represents a distinct methodological 
improvement. More general research on driver behavior and driving skills is 
also relevant to the problem of accident prevention. Such research can be con- 
ducted under realistic driving conditions in the field or through the use of 
simulators in the laboratory. This positive approach holds more promise than 
that of waiting for accidents (or near accidents) to happen. 


SUMMARY 


Engineering psychology uses research on human behavior in the design of 
work methods, working environment, and equipment. Minimizing fatigue is 
an important goal of human efficiency, Fatigue has been variously defined in 
terms of feelings of tiredness, changes in output, and physiological conditions. 
Studies with the respiration calorimeter and with the heart-rate recovery curve 
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illustrate the physiological concept of fatigue. Vigilance tests provide a prom- 
ising behavioral index of fatigue. Feelings of monotony and boredom are diffi- 
cult to differentiate from fatigue, and especially from so-called mental fatigue. 
Monotony depends upon the relationship of task, surroundings, and worker. 
Among the proposed remedies for industrial monotony are personnel selection 
and placement, changes in the working environment, rest pauses, broad work 
units, orientation regarding product and total production process, job rotation, 
and job enlargement. 

Human efficiency can be increased through the effective distribution of 
working time. Properly scheduled rest pauses generally increase total output, 
although the effects may not be immediately apparent. Lengthening the work 
week beyond an optimal number of hours has usually lowered rather than 
raised total output. Shift work involves a disruption of daily physiological 
rhythms, especially for workers on rotating shifts. Individual differences in 
adjusting to changing activity cycles have been noted. 

Beginning with the early contributions of Taylor and Gilbreth, time-and- 
motion study has now become a part of more comprehensive methods-improve- 
ment programs. The latter are characterized by an increasing emphasis upon 
worker participation, broadened scope, and the accumulation of basic research 
on principles of human performance. 

Accident prevention is a recognized objective of modern methods improve- 
ment, besides being related to nearly every other aspect of industrial psychol- 
ogy. Psychologists' contributions to this problem consist largely in research on 
the causes of accidents. Studies on the relation of accident rate to personal 
characteristics must take into account the interrelation of many personal vari- 
ables. Accident-proneness can be conclusively demonstrated only through the 
correlation of individual accident records over different time periods. Studies 
of psychological climate have shown accident rate to be significantly related 
to several situational variables. With the recognition of the importance of task 
variables as causes of traffic accidents, the methods of engineering psychology 
are being used increasingly to improve the layout, illumination, and marking 
of highways. Research on near accidents, as well as the direct study of driver 
behavior and driving skills, provides a promising technique for investigating 
potential sources of danger in operating procedures or in equipment. 
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The Working Environment 


It has long been recognized that working efficiency depends in part upon 
the characteristics of the working environment. Engineers have been concerned 
with the establishment of illumination standards for different types of work, 
the maintenance of suitable temperature and other atmospheric conditions, 
the reduction of noise, and similar problems. Considerable interest has been 
aroused, too, by the possible uses of music to relieve monotony, improve em- 
ployee morale, and facilitate certain kinds of work. It is apparent that such 
problems pertain not only to factories and offices but also to schools, libraries, 
homes, waiting rooms, and any other environments designed for prolonged hu- 
man use. From a different angle, recent developments in space travel have 
stimulated intensive research on the effects of certain extreme environmental 
conditions upon human performance. 

The principal contributions of psychologists to all these problems have been 
methodological. Research in this field has been advanced by the introduction 
of experimental designs that control for such factors as suggestion and attitude 
changes. The Hawthorne Studies, one of which was originally concerned with 
the effects of illumination, focused attention on the need to control these more 
subtle variables. Psychology has also stimulated the investigation of individual 
differences in the effects of environmental variables. It is now recognized that 
in the working environment, as in work methods, certain individual character- 


istics must be considered in setting optimum standards. 


ILLUMINATION 


Amount of Light. To determine optimal illumination levels for different 
types of work, we need some criterion of visual efficiency. Shall we use meas- 
ures of output, subjective reports of comfort, or some physiological index of 
energy expenditure and fatigue? The use of different criteria accounts in part 
for the discrepancies in the results obtained by different investigators and for 
a controversy that began in the 1930s and is not yet completely resolved. In 
general, the research of lighting engineers led to higher recommended levels 
of illumination than did the research of psychologists. The former is repre- 
sented chiefly by the work of Luckiesh and his associates (45), the latter by 
that of Tinke $ 

In a add NANI e and his co-workers compared the effectiveness 
of different illumination levels in terms of such criteria as visual acuity, blink 
rate, heart rate, muscular tension, and expressed preferences (see 45; 47, ch. 
3). Tinker (74, 75) and others have criticized these studies on several grounds. 
First, the Luckiesh experiments employed too few illumination levels to permit 
a precise determination of optimal amounts of light. In most of the studies, 
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only 1, ro, and roo footcandles* were investigated, and in some only 1 and 
too. As Tinker points out, the improvement observed in these studies from 
1o ftc to roo ftc was generally slight; investigation of intermediate intensities 
might have revealed a leveling off in significant change not far above 1o ftc. 

The specific criteria of visual efficiency employed in these studies have also 
been called into question. Visual acuity is undoubtedly the most satisfactory 
among them, since it is most closely related to the visual functions to be per- 
formed. Nevertheless, it should be noted that higher illumination is needed to 
make fine discriminations close to the acuity threshold than would be required 
for the suprathreshold discriminations involved in most visual tasks. Hence 
visual acuity is likely to show continued improvement with increasing illu- 
mination beyond the point where the performance of many ordinary visual 
tasks would exhibit appreciable gains (75, p. 438). 

The results obtained with heart rate are difficult to interpret because of some 
ambiguity regarding the implications of a drop in heart rate. Luckiesh found 
that during continuous reading heart rate declined, the decline being greater 
under 1 ft-c than under 100 ft-c. From this he concluded that reading under 
the dimmer light was more difficult for the subjects. It could be argued with 
greater justification, however, that the greater drop (obtained under 1 ft-c) is 
the more desirable physiological condition and indicates less expenditure of 
effort (4). 

Blink rate and muscular tension have proved equally unsatisfactory as 
criteria, Studies of the frequency of involuntary blinking while the subject is 
performing a visual task under different levels of illumination have yielded 
negative or conflicting results (see 75). Measures of muscle-action potentials in 
various body parts not directly involved in task performance have likewise re- 
vealed no consistent relation to intensity or other qualities of illumination (66). 

In some investigations, these negative findings may result from the subject's 
tendency to reduce output rate and decrease the work-rest ratio under more 
dificult working conditions, as when illumination is poorer. There is some 
evidence to suggest that subjects do in fact adjust their work-rest ratio to the 
difficulty level of the visual task performed (85, p. 133). When work rate is 
controlled by the experimenter, the subjects may make more errors when work- 
ing under more stressful conditions. Measures of effort or energy expenditure 
unaccompanied by measures of output may thus provide an inadequate pic- 
ture of the effects of working conditions. 

On the other hand, measures of output taken by themselves may also be 
misleading, because of the opposite tendency to maintain a "normal" pace of 
work and adjust effort accordingly. This tendency is most likely to affect re- 
sults when observation periods are short. It is apparent that the factors influ- 
encing both effort and output are complex, and their contribution varies with 
the specific circumstances. Reliance upon any single criterion of visual effi- 
ciency may thus be misleading, especially when such a criterion is appli 
without due consideration of situational factors. 

In view of all these difficulties encountered in applying objective criteria 
of visual efficiency, it might seem that subjective expressions of preference 


1 A footcandle (ft-c) is the amount of light from a 1-candlepower source falling on à 
surface 1 foot away. For a simple explanation of the units employed in the measure- 
ment of light, see McCormick (47, ch. 2). 
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for different illumination levels might be the best index after all. Unfortu- 
nately the solution is not that simple. In a well-designed series of experiments, 
Tinker (77) demonstrated that subjects’ choices of illumination levels depend 
to a large extent upon visual adaptation. In one of these experiments, 53 stu- 
dents read individually under direct light for six successive 5-minute periods. 
The reading was preceded by 20 minutes of adaptation to a general illumina- 
tion of 4 ftc. This general illumination was turned off when the local light 
was turned on. Before starting to read, as well as after every 5 minutes of 
reading, the subject was asked to adjust the local light to his preferred level. 
In one series, the subjects began with 1 ft-c and increased the intensity to the 
chosen level; in the other, they began with 5o ft-c and decreased the intensity 
to the desired level. 

The effect of the starting level was revealed both in the initial choice made 
during the first 5-minute period and in the mean level chosen throughout the 
series. When the subjects started with 1 ftc, their mean initial choice was 
14.4; when they started with 50 ftc, it was 29.2. The overall preference 
averaged 21.5 in the former series and 28.9 in the latter. 

In another experiment, Tinker (77) controlled visual adaptation more 
thoroughly. A total of 144 students participated. Working under diffuse, in- 
direct lighting, each subject read for two 50-minute sessions, following an 
initial 15-minute adaptation period. At one session, the subjects were adapted 
to 8 ftc; at the other, to 52 ftc. Half of the subjects began with the 8 ft-c 
session, the other half with the 52 ftc session. During the 50-minute reading 
period, the same light intensity (8 or 52 ftc) was used as a standard with 
which the subject was asked to compare lights ranging from 1 to 26 ftc in 
the one series and from 18 to 100 ftc in the other. Before each comparison, 
the subjects were readapted to the standard light for 4 minutes. In each series, 
the comparison lights were presented in random order. 


TABLE 13 Effect of Visual Adaptation upon Intensity of Light 
Preferred for Reading under Diffuse General Illumination? 
(From Tinker, 77, p. 63) 


Adapted to 8 ft-c (Standard) 
Footcandles 1 2 3 5 8 12 18 26 41 


Percentage of 


times preferred 0.0 "12 1.5 uM ota Sek, POLIT 3:9 ODER 


Adapted to 52 ft-c (Standard) 
Footcandles 18) 30: Meal Peto 002 Saa 06271 2 Sule 00 


Percentage of 
times preferred 54 80 114 140 21.0 116 119 9. 7.6 
ues preremed SS ie e i 


a N = 144; total number of judgments = 1,152. The number of preferences for the 
standard lights were divided by 8 before computing percentages, in order to adjust 
for the reappearance of standard in each of the 8 comparisons. 


The results are summarized in Table 13. It will be seen that in both series 
jects chose the light to which they had been 
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tensity levels. In the 8 ftc series, the median preference fell roughly at 12 ft-c; 
in the 52 ft series, it fell approximately at 52 ftc. It is also noteworthy that 
individual differences in these preferences were very large. With the excep- 
tion of 1 ftc, every given level of illumination was chosen by some subjects. 
The principal implication of these experiments, however, is that expressed 
preferences based on short observation periods are highly susceptible to prior 
visual adaptation. 

Several laboratory investigations have utilized output criteria for various 
tasks performed under different levels of illumination. In a typical study, 27 
male college students took the Purdue Hand Precision Test under 5, 50, and 
150 ft-c illumination (48). This test was chosen because it calls for a combina- 
tion of visual and motor skills required in a number of industrial jobs. ‘The 
subject must insert a stylus successively into three holes that are uncovered 
by a rotating shutter. Each hole is Y inch in diameter. An error is automati- 
cally recorded whenever the stylus touches the side of a hole, the disk in which 
the holes are located, or the rotating shutter. In this experiment, the three 
illumination levels were presented to different subjects in a counterbalanced 
order so as to control the effects of practice, fatigue, motivation, and other 
cumulative factors. As can be seen in Figure 53, the average error scores de- 
creased with increasing illumination. The differences between the mean 
scores at 5 ft-c and those at the other two levels were significant; the difference 
between mean scores at 50 ft-c and at 150 ftc was not. 

Tinker (74-76) has summarized the findings of similar investigations con- 
ducted by himself and others with a variety of tasks, such as reading differ- 
ent sizes of print. He reports the results in terms of critical levels of illumina- 
tion, which he defines as the intensity above which there is no appreciable 
improvement in performance. To these critical levels he adds a certain "margin 
of safety” to provide for individual differences and other unpredictable fac- 
tors. Thus he arrives at recommended levels of illumination for different func 
tions. For reading good-sized print C1o- to 1 1-point), for instance, 10 to 15 ftc 


Fig. 53 Errors in Purdue Hand Precision Test under three levels ot 


illumination. (From McCormick, 47, p. 60; data from McCormick 
and Niven, 48.) 
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is considered adequate.? For sorting mail and other tasks requiring the reading 
of handwriting, 20 to 30 ft-c is recommended. For the most demanding visual 
tasks encountered in work situations, such as bookkeeping and drafting, 40 to 
5o ftc is regarded as desirable. These recommended levels run generally 
lower than those given in standard sources by illuminating engineers (see 
47, pp- 64-67). 

The amount of light needed for clear seeing varies with characteristics of 
the task and of the person. Thus, the less the contrast between objects to be 
discriminated, the higher the illumination required. Identifying black objects 
against a gray background calls for a brighter light than does identifying the 
same objects against a white background. Individual characteristics also affect 
light requirements. Persons with subnormal visual acuity or other visual de- 
ficiencies can improve their performance through the use of brighter illumina- 
tion (20, 41). Similarly, older persons need more light than younger persons. 
The acuity of older workers can often be brought up to that of younger work- 
ers by increasing illumination to the proper level (21). 

A number of studies conducted in industrial settings show the relation be- 
tween changes in illumination and work performance (see 47, pp- 61-63; 74). 
Most of these merely compared performance records before and after the intro- 
duction of some improvement in illumination, usually an increase in the 
amount of available light. Many operations were covered by these studies, in- 
cluding letter sorting, weaving, machineshop work, inspection, assembly, and 
various other skilled or semiskilled manufacturing operations. Initial illumina- 
tion was generally quite low, varying from less than 1 ftc to slightly over 
13 ft-c for different types of work. When illumination was raised to levels 
ranging from about 5 to 29 ftc, output improved by amounts varying from 4 
to 35 per cent. Other, similar studies have shown a decrease in errors and 
spoilage or a sharp drop in accident rate following improvements 1n lighting 
conditions. 3 

This type of experimental design, of course, is subject to the Hawthorne 
effect, Rises in output may have resulted partly or wholly from better employee 
attitudes. On the other hand, if employee attitudes can be improved by slight 
increases in illumination, such changes may be fully justified on those grounds 
alone. Perhaps some of the illumination changes did increase visibility, while 
others only made the surroundings more cheerful and attractive. From a prac- 
tical standpoint, anything that increases job satisfaction or productivity is 
desirable. From a scientific standpoint, however, we would like to identify 
just what factors led to improved job performance and to learn how they 
operated. This information is needed if we want to be able to generalize be- 
yond the immediate situation in which data were gathered. tee 

An example of the type of experimental design required for the identifica- 
tion of cause-effect relations is provided by a study of nine card-punch oper- 
ators in the Bureau of the Census (see 47, PP- 62-63). Three card-punch 
machines were set up in each of three work zones. In Zone I, the original in- 
candescent-lighting conditions were continued unchanged. In Zone II, illu- 
mination on work surfaces was increased by painting walls, ceiling, and 
equipment; recovering floor; and cleaning light fixtures and furnishing new 
3 i " esponds roughly to the light falling on a working 
AA z Gaal fs gus Josie inches from a frosted 5o-watt 
bulb (74, p. 13)- 
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bulbs. In Zone III, luminous indirect fluorescent fixtures were installed, in ad- 
dition to making the general renovations described for Zone II. The illumina- 
tion levels at the machines averaged 28, 38, and 49 ftc in Zones I, Il, and III, 
respectively. Performance records were kept for nine weeks. During this pe- 
riod, each employee worked for one week at each machine in a random se- 
quence. Mean production indices for Zones I, II, and III over the nine-week 
period were 43.7, 46.0, and 50.4. The difference between Zones I and II was 
not statistically significant, but that between Zones I and III was. From these 
results, the investigators concluded that for card-punch operation an illumina- 
tion level of about 5o ft-c is desirable. 

Some industrial situations present special lighting problems. An example is 
the airplane cockpit during night flying, when it is desirable to preserve as 
much dark adaptation as possible to facilitate the pilot’s perception of stimuli 
outside the airplane. Thus the question is: How far can illumination be re- 
duced and still permit accurate reading of dials? In a laboratory experiment 
designed to help answer this question, 20 subjects with normal vision were 
tested individually with photographic reproductions of a bank of 12 dials (72). 
After a 10-minute period of dark adaptation, the subject was instructed to 
read each dial orally as rapidly and accurately as possible. The subject read 
10 cards at each of five brightness levels, corresponding to .005, .o18, .022, 
.296, and 6.0 foot-lamberts.? The sequences of illumination levels were rotated 
for different subjects to balance practice and fatigue effects. 

Performance records showed considerable improvement in speed and accu- 
racy up to .o22 ftL, with no significant change above that level. At lower 
levels of illumination, both frequency and magnitude of errors increased. 
When smaller dials were substituted in a second experiment, performance 
continued to improve at higher illumination levels, the critical region for 
accuracy falling between .o2 and .o5 ftL. The principal contribution of this 
investigation was to demonstrate that fairly low levels of illumination are 
adequate for accurate performance of a task such as dial reading, which calls 
for visual discriminations well above threshold value. 

Distribution of Light. Other characteristics of light besides its intensity 
affect visual efficiency. Of particular importance is the degree of uniformity 
of illumination. Poor distribution of light is likely to produce glare, which may 
cause discomfort, visual fatigue, poor performance, and accidents. Glare may 
result from the presence of direct light sources in the visual field, from marked 
brightness contrast between two surfaces, or from surfaces that concentrate 
the light in one direction instead of diffusing it (e.g., shiny metal, glossy 
paper). y 

Any bright object tends to attract the eyes away from the work surface. 
Muscular effort is expended in turning the eyes back and forth between fixa- 
tion points, as well as in the accompanying convergence and accommodation 
changes. The presence of bright spots in the visual field thus heightens visual 
fatigue, which is largely fatigue of the eye muscles. To maintain fixation on 


* The footlambert (ft-L) is a unit for expressing the brightness reflected from a 
surface. Depending not only upon the intensity and distance of the light source but 
also upon the reflecting properties of the work surface, it is the most appropriate unit 
to use in studies of the effect of illumination on visual efficiency. One foot-lambert is 
the brightness of a perfectly reflecting surface illuminated by 1 ftc. Since even white 
paper ordinarily reflects only about 8o per cent of the light, the foot-lamberts reflected 
by a work surface are less than the footcandles falling upon it. 
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the work surface with a minimum of strain, the surroundings should be some- 
what darker than the work area. At the same time, the contrast should not be 
so great as to require shifts from light to dark adaptation. It has been generally 
recommended (although without much empirical verification) that the bright- 
ness ratio should be no greater than 3:1 between work area and immediate sur- 
roundings and no greater than 10:1 between work area and more remote parts 
of the room (47, p. 71). Excessive uniformity of light may also be undesirable, 
because of its emotional effect on the worker. Such lighting may produce 
boredom and sleepiness (85). In most practical situations, satisfactory results 
can be obtained through a combination of diffuse general illumination of fairly 
low intensity and local light units near the work surface. 

As examples of laboratory investigations that have demonstrated the advan- 
tages of diffuse lighting and glare reduction, we may consider the pioneer ex- 
periments of Ferree and Rand* (19). In these experiments, as in most of their 
research on various aspects of illumination, Ferree and Rand administered 
visual acuity tests before and after a three-hour period of continuous reading 
under each kind of illumination. Recognizing that momentary tests of per- 
formance may fail to reveal fatigue, they required the subject to observe two 
points continuously for three minutes and to indicate when they blurred and 
when they were seen as separate. The percentage of time during which the 
two points appeared separate was taken as an index of the subject's visual 
efficiency at the time. These percentages were compared in the two acuity 
tests to determine what loss, if any, had occurred as a result of the three hours 
of reading. "D 

In one experiment, Ferree and Rand (19) compared three types of illumina- 
tion differing in diffuseness. With indirect lighting, which provides the most 
diffuse illumination, they found little loss in visual acuity in the course of 
three hours of continuous reading. With semi-indirect and especially with 
direct lighting, the drop in acuity was much sharper. This loss could be 
minimized, however, by reducing the number of lighting fixtures within the 
subject's visual field. Each light source visible to the subject represents a pos- 
sible glare spot that might attract the eyes away from the work surface. Visual 
efficiency showed the least loss when no light fixture was visible to the subject; 
the loss increased. progressively when two, four, and six fixtures were visible. 

Color of Light. On theoretical grounds, pure, or monochromatic, light 
should give a clearer image than light that is a mixture of several colors, or 
wavelengths, The different wavelengths are refracted differently in passing 
through the lens of the eye and hence come to a focus at slightly different 
points. This phenomenon, known as “chromatic aberration, produces a slightly 
blurred image. Daylight, as well as light from common artificial sources, is a 
mixture of different wavelengths. Certain types of artificial illuminants, such 
as sodium-vapor lamps, have been specially developed to provide nearly 
monochromatic light, from a narrow spectral band. Contrary to expectation, 
however, empirical studies show little or no significant differences between 
visual performance under such lights and under ordinary illumination (47, 
ch. 3; 74). 

Nor have appreciable 
4 Like the Gilbreths, whose work was cited in Chapter 7, Ferree and Rand were a 
husband-and-wife team that collaborated in a long series of research projects. In this 
case, however, both were psychologists. 


differences in visual efficiency been found from one 
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color of light to another, except possibly for tasks at or near threshold level. 
In tasks requiring color discrimination, of course, colored lights that alter the 
surface color of objects should be avoided. Another relevant consideration is 
the affective responses of persons to differently colored lights. A greenish light 
that gives a sickly cast to the complexion has been known to make some work- 
ers ill! 

Still another factor that may influence the choice of illuminant color in 
special situations is dark adaptation. This is illustrated by the previously cited 
example of cockpit illumination for night flying. Light from the red end of 
the spectrum permits the human eye to dark-adapt faster and more completely 
and to maintain dark adaptation better than does light from other parts of the 
spectrum. Accordingly, an experiment was conducted to determine the effect 
of color on the ability to read dials in dim light (73). The general procedure 
was similar to that followed in the previously reported study on dial reading 
under different levels of illumination. Four colors, including yellow-green, 
yellow-orange, orange-red, and deep red, were investigated at each of two il- 
]umination levels (.o1 and .1 ft-L). 

Although at .or ft-L red light tended to be inferior to the other colors in 
terms of accuracy and speed of dial reading," at .1 ft-L it was equal or superior 
to them. It was also found that all four colored lights (which were relatively 
monochromatic) yielded better performance than did the white light used in 
the earlier experiment. The practical implication of this experiment is that red 
light, with its advantages for dark adaptation, may be used in cockpit illumina- 
tion without loss in speed or accuracy of dial reading. 


ATMOSPHERIC CONDITIONS 


Regulation of Body Heat. In the process of metabolism, the body gen- 
erates heat. When the organism is at rest, metabolic rate is slower and less heat 
is generated than when the organism is engaged in physical activity. The more 
strenuous the activity, the greater the amount of heat generated. To maintain 
its normal temperature, the body must give off excess heat. If body heat is 
dissipated too slowly, one feels uncomfortably warm; if it is dissipated too fast, 
one feels chilled. Body heat is dissipated chiefly through evaporation of per- 
spiration, radiation of thermal energy from the body to cooler surfaces in its 
environment, and convection of heat by the air around the body. 

The cooling power of a particular environment depends upon other char- 
acteristics besides its temperature. There is a sound factual basis for the pop- 
ular cliché that "It's not the heat, it's the humidity" which often causes dis- 
comfort. High humidity reduces the rate of evaporation of perspiration and 
thus decreases the effectiveness of an important mechanism for the dissipation 
of bodily heat. The role of humidity can be illustrated by the reactions of a 
group of men exposed for one hour to different combinations of temperature 
and humidity (see 47, pp. 232-233). When the humidity was only 10 per 
cent, temperatures as high as 140° Fahrenheit were judged to be tolerable. On 
the other hand, when the humidity reached 80 per cent, a temperature of 
110° proved intolerable. Similar interactions of temperature and humidity 


5 .o1 ft-L falls below the critical illumination level identified in the earlier study (72)- 
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have been found when the ability to carry on physical work was the criterion 
(47, PP- 233-234). 

Circulation of the air also affects its cooling power. The effectiveness of 
fans derives from this principle. Dissipation of bodily heat is accelerated if 
the blanket of warm air that surrounds the body is carried away and cooler 
air replaces it. However, if air temperature is near or above body temperature 
and humidity is high, fans will be of no avail. 

At one time it was believed that the discomfort and other ill effects of poor 
ventilation resulted from changes in the chemical composition of the air we 
breathe. Under ordinary circumstances, however, changes in the proportion of 
oxygen, carbon dioxide, and other chemicals in the air are too slight to affect 
the organism (see 47, p. 243). Even if a crowd of people remains for several 
hours in an unventilated room, the proportion of oxygen will still be well 
above the minimum requirement and the proportion of carbon dioxide well 
below the safe maximum. The discomfort experienced under these circum- 
stances arises from the body’s inability to dissipate heat properly. This fact was 
demonstrated in a classic experiment in which subjects in an ill-ventilated 
room breathed fresh air conveyed through a tube (see 80, pp. 9-12). Under 
these conditions, they experienced the same discomfort and showed all the 
physiological symptoms associated with poor ventilation. When the situation 
was reversed and they breathed stale air while sitting in a well-ventilated 
room, they experienced no discomfort and showed no physiological changes. 

Laboratory studies (80) have also shown that output in physical work de- 
clines sharply under conditions of poor ventilation but that mental work is 
relatively unaffected. Such findings are understandable when we consider 
that much more heat is produced during physical than during mental work. 
When subjects feel uncomfortable because of poor ventilation, however, they 
are less willing to carry on mental work. Hence, when motivation is not very 
high, or when work continues over long periods of time, output may slacken 
off in mental tasks too. 

Ventilation Studies in Industry. Several early investigations by the In- 
dustrial Fatigue Research Board of Great Britain were concerned with at- 
mospheric conditions in factories and mines Csee 1). Some of these studies 
used the kata thermometer, an instrument that records the cooling power of 
the air resulting from the combined effects of temperature, humidity, and rate 
of air motion. By this method, the cooling power of the air in different coal 
mines or at different depths within a mine was determined. The results 
showed that, as the cooling power decreased, the number and severity of acci- 
dents rose, the duration of voluntary rest pauses increased, and output de- 
creased. (82). In another study conducted in coal mines, absenteeism was 
found to be related to temperature (83). As temperature rose above 70°F, 
the percentage of working time lost on account of employee illness increased. 

In still another investigation (63), the frequency of minor accidents in 
three munitions plants was plotted against daily temperature over à period of 
about ten months. The minimum number of accidents occurred at a temper 
ature of 67.5°F. As the temperature rose to a high of about 78°F or dropped 
to a low of about 48°F, accident rate increased by as much as 30 per cent. The 
physical discomfort resulting from any pronounced deviation from the opti- 
mum temperature is likely to distract workers and hence lead to accidents. In 
addition, at high temperatures perspiration makes it more difficult to grasp 
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tools properly; and, at low temperatures, stiffness of the hands interferes with 


coordination of movement. 

The effect of installing fans was investigated in a textile mill, where tem- 
perature and humidity had to be kept fairly high to prevent breakage of 
thread (88). Fans were operated on alternate days over a period of six weeks. 
Analysis of output records showed that on the days when the fans were in 
operation productivity was significantly higher and waste lower than on the 


other days. 
A more recent study (54) was concerned with the performance of telegraph 
operators under extreme climatic conditions. The average number of errors 
made by 11 operators in receiving Morse code was compared under different 
combinations of temperature and humidity. The operators had been acclima- 
tized to the atmospheric conditions under investigation for periods of 7 to 11 
weeks prior to the experiment. Results were reported in terms of effective 
temperature, a commonly used index combining into a single value the effects 
of temperature, humidity, and air velocity on the sensation of warmth felt by 
the human body (34). Any given effective temperature (ET) corresponds to 
the temperature of still, saturated air (100 per cent humidity) that would give 
the same perceived warmth. Thus an ET of 80° is equivalent to a temperature 
of 80°F with 100 per cent humidity and completely stagnant air. It follows 
that 80°ET would be considerably more uncomfortable than a temperature of 
80° experienced under ordinary conditions of humidity and air movement en- 


countered in everyday life. 
Figure 54 shows average number of errors per man-hour with effective 


temperatures of 79° and above. It can be seen that when ET exceeds 95° 
there is a sharp rise in errors. It is also noteworthy that the increase in errors 


Fig. 54 Relation between effective temperature and average number of 
errors in Morse code receiving. (Data from Mackworth, 54, p. 148.) 


200 
—— Ist hour 
— — 2nd hour 
=-=- 3rd hour 


8 


- 
b 
o 


, 
, 
, 
l; 
I 
r 
d 
1 
i 
I 
I 
' 
, 
, 
I 
, 
I 
I 
I 
i 
[i 
li 
I 
1 
1 
! 
i 
i 
I 
, 
i 


- 


Average number of errors per man-hour 
è 8 
T — 


pra 90* 95 100° 


75* 
Effective temperature 


203 


The Working Environment 


is greater during the third hour than during the first hour of work, indicating 
the cumulative effects of working for long periods of time under such uncom- 
fortable conditions. Analysis of individual records showed that deterioration 
under adverse atmospheric conditions was greater for operators who were in- 
itially less skilled than for the more highly skilled. 

Weather and Related Conditions. The various atmospheric variables con- 
sidered in connection with ventilation are obviously related also to the broader 
and more pervasive conditions subsumed under weather, seasons, and climate. 
Theories regarding the effects of climate upon human behavior and upon the 
development of civilizations date back to Aristotle and probably earlier. Dur- 
ing the first quarter of the present century, Huntington (38) wrote extensively 
on this subject and conducted several field surveys to test his hypotheses. Of 
paricular interest were his analyses of seasonal variations in the output of 
piece-rate workers in different latitudes (38, ch. 4). For example, 410 oper- 
atives in three Connecticut hardware factories, surveyed over a period of four 
years, showed production peaks in the spring and fall, with a steep production 
drop in midwinter and a smaller drop in midsummer. One year when the sum- 
mer was exceptionally cool, the midsummer drop in output failed to appear. 
These factories were chosen because working conditions were unusually uni- 
form throughout the year and there seemed to be no major external constraints 
affecting production. 

When the output curves of 120 workers in South Carolina cotton factories 
were similarly analyzed, the spring and fall peaks were found to be farther 
apart, both falling closer to the winter months than was true in the Connecti- 
cut survey. Moreover, the midsummer production drop was now steeper than 
the midwinter drop. These trends became more pronounced in the output 
curve of 2,300 cigar makers in Tampa, Florida. In this group, there was a 
single output peak in midwinter, as well as a steep midsummer drop. In this 
connection, it is also relevant to note that, in a study of tinplate mills con- 
ducted by the Great Britain Industrial Fatigue Research Board (81), a 13 
per cent summer drop in output was found in a plant with no artificial venti- 
lation and poor natural ventilation, while the summer drop was only 3 per cent 
in another plant with artificial ventilation that provided good air circulation. 

In another early study, Dexter (14) compiled a mass of statistical data on 
the relationship between daily weather conditions and the frequency of such 
occurrences as natural deaths, crimes of violence, suicides, absences from 
school, and clerical errors of bank clerks. Dexter interpreted the various re- 
lationships he found as suggesting that one’s general vitality increases on 
moderately warm, clear days and decreases on cold, cloudy, and rainy days. If 
weather does have any significant effects upon human behavior, it is unlikely 
that they can be summed up as simply as this. The interpretation of survey 
data of this sort is difficult because of the large number of uncontrolled fac- 
tors that may influence the results. 4 

Recently there has been a revival of interest in the relation between weather 
conditions and human behavior. A research group established in Germany, for 
example, has been gathering data on the effects of many different meteorologi- 
cal variables on the human organism (60). These investigators differentiate 
between periodical meteorological variations, corresponding to daily and sea- 
sonal cycles, and aperiodical rhythm-changing processes, which they believe 
to be more stressful for the organism. The effects of storms would be an ex 
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ample of the latter changes. Surveys of large groups conducted by these in- 
vestigators showed that such aperiodic weather changes tended to be associated 
with a rise in frequency of industrial accidents, an increase in number of visits 
to plant dispensaries for medical aid, a lengthening of reaction time, and a low- 
ering of critical flicker frequency. Studies of “tropical fatigue," conducted 
during and after World War II, represent another area of research in psycho- 
climatology (see 16, 18). Still further evidence of interest may be seen in the 
fact that the American Institute of Medical Climatology, established in 1958, 
contains a section on psychology. 

Mention may also be made of recent research on the possible biological and 
behavioral effects of atmospheric ions (25). “Ionization” refers to the state of 
electric charge of the atmosphere. All atmospheric particles can be described as 
either neutral or positively or negatively charged. The concentration of posi- 
tive or negative ions in the air can be radically altered through such factors as 
heat sources, operating electronic equipment, or storms. In a number of experi- 
ments conducted by biologists, certain behavioral effects of ion concentration 
have been observed incidentally. None of the behavioral effects, however, has 
as yet been demonstrated under adequately controlled conditions. 

One psychological study designed specifically to test alleged effects of ion 
concentration on reaction time obtained negative results (65). A number of 
commercially available ion generators, or “ionizers,” have been marketed on 
the supposition that an increase in negative-ion concentration is conducive to 
improved vitality and feelings of well-being. The previously cited study (65) 
also lent no support to such claims, since subjects were unable to identify in- 
creased concentration of positive or negative ions and their reported feelings 
during experimental and control sessions were indistinguishable. Any conclu- 
sions regarding the effects of atmospheric ions on behavior must await further 
research. 

Extreme Conditions. Although relatively restricted in its applications, con- 
siderable research has been done on more extreme atmospheric conditions than 
those discussed in the preceding sections (see 47, ch. 9). Among the factors in- 
vestigated are extreme cold, as found in arctic regions; toxic gases, radioactive 
material, and other impurities in the atmosphere; rapid changes in barometric 
pressure, such as might be encountered during escape from a submarine or in 
a sudden emergency loss of pressure in high-altitude pressurized aircraft; and 
oxygen deficiency, or anoxia, as found at high altitudes. 

OF all these conditions, the one whose psychological effects have been most 
thoroughly investigated is anoxia? (47, ch. 9; 49; 50; 51, ch. 3). By placing 
subjects in experimental chambers, it is possible to expose them to atmospheric 
conditions corresponding to different altitudes, to present these conditions in 
random sequence, and to intersperse them with control sea-level sessions in 
order to rule out the effects of suggestion. Although individuals differ widely 
in their responses to oxygen deficit, behavioral effects of anoxia usually begin 
at an altitude of approximately 10,000 feet and increase in extent and severity 


6 Critical flicker frequency (CFF) is the rate at which a flickering light is first seen 
to fuse. It is believed that CFF may serve as an index of the efficiency of functioning 
of the nervous system. 

7 Strictly speaking, “anoxia” means oxygen lack, and the correct term for oxygen 
deficit is "hypoxia." In common usage, however, anoxia designates any degree of oxygen 
deficit, and it is so employed in this text because of its greater familiarity. 
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with increasing altitude. Loss of consciousness and collapse generally occur at 
about 20,000 feet. 

The individual is usually unaware of the onset of anoxia, a fact that makes 
it all the more important that persons who may be exposed to oxygen deficit 
take proper precautions in advance. The first symptoms are often emotional: 
the individual may become irritable and belligerent or—more often—euphoric 
and jolly. These initial changes are not unlike the effects of mild doses of 
alcohol. With increasing altitude, there is progressive loss of sensory and motor 
functions. Night vision may be impaired even below 10,000 feet in some 
persons. Visual acuity, color discrimination, and other visual functions are 
among the first to decline. Hearing is affected later. Complex motor processes 
are disrupted earlier than simpler activities. Handwriting deteriorates, and 
speech becomes slurred and unintelligible. At higher altitudes reaction time is 
lengthened. Intellectual functions are likewise affected in the order of their 
complexity. The more complex functions requiring memory, reasoning, and 
judgment are lost first; at higher altitudes, even such simple functions as can- 
celing letters deteriorate. 

Rapid ascent to very high mountains, as in a railroad, produces anoxic 
effects similar to those described above. Slower ascent, as in ordinary mountain 
climbing, produces a rather different set of symptoms, which may include ill- 
ness and depression (50). Persons reared at high altitudes show considerable 
physiological adaptation to such altitudes. This is true of the residents of cer- 
tain villages in the Andes, who normally live at altitudes close to 15,000 feet. 
Such persons also exhibit much greater tolerance to acute and severe anoxia 
when experimentally exposed to simulated altitudes beyond 30,000 feet (79). 


ENVIRONMENTAL FACTORS IN SPACE TRAVEL 


In their approach to problems of space travel, psychologists have been able to 
apply well-established research methodologies and other procedures previously 
developed in other contexts. Personnel selection and training are an important 
aspect of the space program and one to which psychologists have made major 
contributions (see 30; 68, ch. 8). The advantages of training devices and sim- 
ulators are especially evident in the preparation of astronauts for both. routine 
and emergency tasks that they may have to perform in the space vehicle. Re- 
search on human factors in equipment design and the development of effective 
man-machine systems, to be discussed in the next chapter, are basic to the suc- 
cessful operation of manned spacecraft. Within the context of the present 
chapter, it is the extreme environmental conditions encountered in space travel 
that are of primary interest. Although several environmental factors have 
been investigated in this connection, those that have stimulated the greatest 
amount of research are weightlessness and isolation. : 
Weightlessness and Related Gravitational Phenomena. The experience 
of weightlessness, resulting from zero gravity, is one of the most spectacular 
problems with which the space traveler must cope (see 2, pp. 422-434; oo 
30; 33; 44; 68). Human posture and locomotion are adapted to the earth’s 
normal gravitational pull. Under highly reduced or zero gravity, locomotion is 


* Modern high-altitude aircraft are regularly equipped with -— ne cali 
produce the atmospheric conditions pevailing at low altitudes. Under such con > 


of course, the symptoms of anoxia do not occur. 
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extremely difficult, movements are erratic and uncoordinated, and perception 
becomes disoriented. Circulatory changes, nausea, and other physiological dis- 
turbances may also occur. Prolonged exposure may lead to muscular weakening 
from disuse. Return to normal gravity conditions may be accompanied by 
marked debility and gross disruption of psychomotor performance. 

In short space flights in which the operator is strapped to the seat, weight- 
lessness does not significantly impair performance. Any initial disorientation 
occurring under these conditions can be overcome with practice, especially if 
visual cues are available. In the first United States manned orbital space 
flight, for example, in which the pilot remained weightless for approximately 
4% hours, manipulation of controls was unaffected and no adverse physiologi- 
cal or psychological effects were observed (61, pp. 99, 119, 136). In a longer 
flight, one of the Russian astronauts did report symptoms of motion sickness, 
but others experienced no such ill effects during even longer periods of weight- 
lessness. The problem of weightlessness with a free-floating subject, however, 
has not yet been satisfactorily solved (30, 69). Empirical data on the effects of 
prolonged weightlessness are also very meager. 

For experimental purposes, weightlessness can be produced by flying an air- 
craft in a parabolic trajectory so that centrifugal force exactly balances the pull 
of gravity. This technique has been utilized in most of the research on the 
effects of zero gravity with both animal and human subjects. By this method, 
however, weightlessness can be maintained only for periods of a minute or less. 
Because of this limitation, some investigators have experimented with other 
situations that simulate at least some aspects of weightlessness. For instance, 
subjects have been floated in water-filled tanks for several hours at a time 
while the effects on both physiological condition and task performance were 
measured (28, 33, 44). 

Still another approach to the problem has been concerned with the effects of 
slow rotation upon human performance (1 3, 29). This sort of experiment is of 
interest because artificial gravity can be produced by rotating the spacecraft or 
space platform around its own gravitational center. The difficulties of weight- 
lessness might thus be overcome, provided the rotation itself did not introduce 
new disturbances. Preliminary results suggest that with adaptation any initial 
deleterious effects of such rotation tend to disappear. i 

A related problem pertains to the high accelerative forces experienced dur- 
ing launch and reentry. In a sense, this condition is the opposite of weightless- 
ness, since the gravitational force acting upon the body is increased many 
times Csee 47 ch. 8). High acceleration can be simulated in a human centri- 
fuge. This device bas been used experimentally to determine the amount of 
accelerative force that can be safely tolerated for different periods of time, the 
optimal body position and other conditions that affect such tolerance, and the 
effect of different amounts of force upon the subject's ability to operate con- 
trols and perform other tasks (2, pp. 464-477; 57; 84). Figure 55 shows an 
astronaut preparing to enter the gondola of the human centrifuge. Within this 
gondola were installed the display and control panels of the Mercury space 
capsule, thus permitting direct observation of the effects of these environ- 
mental stresses upon performance in space flight. The centrifuge has also been 
effectively employed for purposes of adaptation and training in preparation for 
actual space flight. 


It should be noted that many of the psychological problems investigated 
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Fig. 55 Human centrifuge used to simulate accelerative stresses 
encountered during lift-off and reentry in space flight. (Courtesy of 
National Aeronautics and Space Administration.) 


with reference to propeller and jet aircraft are also relevant to space travel. An 
example is to be found in the research on pilot disorientation (1o, 12). In the 
flight situation, familiar sensory cues are often encountered in unfamiliar con- 
texts and hence misperceived. As a result, the position and motion of the air- 
craft may be misjudged. The gravity receptors Cotolith organs) located in the 
inner ear play an important part in our spatial orientation. In the absence of 
suitable visual cues, these gravity receptors may lead to disorientation because 
of the unusual conditions of gravity and acceleration encountered in flight. 
Some accidents attributed to “pilot error" have probably resulted from such 
spatial disorientation. 

When in flight, the human organism is subject to a number of illusions not 
occurring in earthbound conditions. For example, a fixed visual target viewed 
in the dark will appear to rotate as a function of the acceleration or decelera- 
tion of the airplane (9). The best safeguards against illusions and spatial 
disorientation are provided by knowledge regarding the occurrence of these 
phenomena and by adequate training in the use of instruments and other 
dependable checks. 

Isolation. The conditions discussed in the preceding section illustrate sev- 
eral kinds of physiological stress that may arise in space travel. In addition, 
psychological stress may be induced by various aspects of the situation, such as 
unfamiliarity, danger, and isolation. Considerable research has been conducted 
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on the effects of isolation (see 2, pp. 409-421; 86). The problem of isolation is 
of concern not only in space travel but also in long-submerged submarine 
cruises, assignments to arctic bases, and a number of similar situations. Isola- 
tion may involve several factors, chief among which are confinement, detach- 
ment or separation, and sensory deprivation. Although these factors are inter- 
related and often occur together, they can be considered independently for 
purposes of analysis. 

The effects of confinement on the individual's psychological condition and 
on his performance have been studied in both laboratory and field situations 
C7, 17, 30, 31, 33, 62, 64). The psychological stress induced by confinement 
arises partly from restriction of movement and partly from being enclosed in a 
small space, a condition which some persons find intrinsically oppressive. 
When several persons are confined together, as in the case of a work crew, 
there are additional problems of interpersonal relations. Lack of privacy and 
inability to escape from an unpleasant social situation are potential sources of 
stress. 

In certain types of isolation, the individual may experience feelings of de- 
tachment, separation, and aloneness. Such feelings are illustrated by the 
“break-off” phenomenon described by some high-altitude fliers (see 11; 71, pp- 
159-173). In an investigation of this experience, 137 military jet pilots were 
interviewed (11). Of this group, 35 per cent reported that they had experi- 
enced the break-off effect, and an additional 12 per cent said they had heard 
others describe it. Those who experienced it characterized the break-off effect 
as a feeling of being isolated or separated physically from the earth. Feelings of 
loneliness often accompanied this experience. Some pilots reported exhilaration 
and a sense of power; others, anxiety and insecurity. The break-off effect oc- 
curred most often when the pilot was flying alone, at a high altitude, and 
with little to keep him busy. It appeared to be unrelated to the pilot's flight 
experience but seemed to be influenced by personality characteristics. 

Sensory deprivation refers to a reduction in the absolute level, variability, 
or patterning of sensory stimulation. In milder forms it produces boredom and 
restlessness; in more severe forms it may lead to hallucinations and other seri- 
ous aberrations. Although research on the effects of sensory deprivation is of 
recent origin, its implications have aroused interest in many connections, from 
child development to psychotherapy, and from “brainwashing” to space travel 
(see 22, 71). From a theoretical standpoint, it has been suggested that variety 
of stimulation contributes to the activation level, or “alertness,” of the organ- 
ism and that activation level is in turn related to the effectiveness of behavior. 
Underlying neurological mechanisms have been proposed to explain this 
phenomenon (35, 71). 

Under certain circumstances, highly repetitive visual tasks may induce sleep 
or mild hypnosis, Similarly, when driving on a superhighway with little to do 
and few objects to see along the roadside, some drivers become drowsy and 
sleepy. In long-distance truck driving, hallucinations are a recognized accident 
hazard (52, pp. 124-125). In experimental studies of sensory deprivation, 
adult volunteers have usually been confined individually in a small enclosure 
providing a minimum of visual, auditory, and touch stimuli for periods rang 
ing from a few hours to several days (15, 23, 58, 67, 71, 87, 90). Among te 
observed effects are perceptual distortions and cognitive disorientations, major 


209 The Working Environment 


disturbances of time estimation, hallucinations, inability to concentrate, decline 
in test performance, and unpleasant emotional states. Some subjects voluntarily 
terminate the experiment ahead of schedule because they find it intolerable. 
It is noteworthy, too, that subjects in such isolation experiments characteristi- 
cally show an abnormal concern with time (58). 

Mention should also be made of the autokinetic illusion, long familiar in the 
psychological laboratory. When an observer gazes steadily at a small, dim 
light in an otherwise dark room, the light soon appears to move in an irregular 
path. This illusory movement has practical importance in night flying and may 
account for some aircraft accidents (27). 

Individuals vary widely in their reactions to all aspects of isolation. Such 
individual differences are especially marked with regard to emotional responses. 
In general, persons who adjust well to the psychological stresses of isolation 
are those who adjust well in other everyday-life situations. Motivation is an im- 
portant factor in counteracting the adverse effects of isolation. Personnel se- 
lection is thus a major way to prevent isolation effects. Habituation and train- 
ing, including a knowledge of expected phenomena and specific ways of cir- 
cumventing them, are also effective. Isolation stress may be further reduced by 
maintaining communication and by providing activity and stimulation for the 
operator When a crew must work together in prolonged confinement, the 
use of sociometric techniques in assigning members to the group may be help- 
ful (see ch. 6). 


NOISE 


In discussions of the effects of noise on human behavior, the term "noise" has 
commonly been used to mean "unwanted sound" (3; 47, ch. 8). Regardless of 
its physical properties, noise in this context customarily refers to sounds that 
are injurious, disruptive, distracting, or annoying. 

Noise and Hearing Loss, There is abundant evidence that exposure to 
loud noises produces hearing loss, although many of the details regarding the 
characteristics of the sound stimuli and the nature of the effect are still to be 
worked out (3; 47, ch. 8). It is well established that several years’ exposure 
to a high noise level, as in the work of boilermakers, riveters, and train crew- 
men, may result in permanently impaired hearing. The frequency and severity 
of such impairment varies with the loudness of the sounds in the work en- 
vironment, as illustrated by the data from different departments of an aircraft 
plant, shown in Figure 56. , 

Some of the hearing loss resulting from noise is temporary. Partial or total 
recovery may occur within periods varying from a few hours to several months. 
The extent and rate of recovery probably depend upon the individual, the 
nature of the sound, and the duration of exposure. Estimates of the lower 
bounds of harmful noise level vary widely, but the majority cluster at or 
slightly above 100 decibels (db) for continuous exposure (47, p. 191). This 
intensity corresponds roughly to the maximum noise obtained from riveting or 

| 9 As an illustration of the successful application of all these procedures, it is note- 


i ivati bserved 
worthy that no evidence of sensory deprivation or break-off phenomenon was obser 
by the astronaut John H. Glenn, Jr. during the first American manned orbital space flight 


(61, p. 99). 
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Fig. 56 Percentage of workers in four departments of an aircraft plant 
with "normal" hearing versus those with hearing loss. (Data from 
MacLaren and Chaney, 55, p. 112.) 
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from a New York subway train (8, p. 112). Intermittent loud noises, as from 
gun blasts, may also cause either temporary or permanent hearing loss. 
Noise and Auditory Communication. In work requiring the proper re 
sponse to auditory cues, noise may interfere directly with task performance. 
Considerable research has been done on the masking of sounds by different 
kinds of noise (47, pp. 147-152). The effect of such masking is to raise the 
auditory threshold for various sounds. We can thus determine how much the 
loudness of a sound signal or other auditory stimulus must be increased for 
the stimulus to be just barely audible against the given noise. The task may 
call for discrimination of two or more auditory signals, as in underwater sonat 
operations or in certain kinds of blind flying (47, pp. 174-177). In this con- 
nection, it has been found that the smallest noticeable difference between the 
signals depends upon the ratio between signal intensity and noise intensity. 
A problem of widespread practical importance concerns the intelligibility of 
speech under different noise conditions. Whether in face-to-face contacts, over 
the telephone, or in an “intercom” system, many situations demand accurate 
voice communication. Accordingly, extensive research has been devoted to 
testing the effects of noise on the understanding of speech under different 
circumstances. Attention has also been given to ways of improving communica- 
tions that must be carried out under adverse noise conditions (47, ch. 72- For 
example, the substitution of a set of easily identifiable words (like “Roger” 
for letters improves the intelligibility of the message itself. Training personne 


211 


The Working Environment 


in communication techniques that have proved effective also helps. It has been 
found, for instance, that under noisy conditions intelligibility improves if the 
pauses between syllables and words are shortened and the speech sounds them- 
selves are prolonged. 

Noise as Distraction. Consider the following statements: 


"I can study much better with the radio going full blast —commercials and all." 

"] never turn the radio on myself when I want to work, but it doesn't bother me if 
my roommate turns it on. I just ignore it." 

"I simply can't work in the library. With people walking around or whispering, it's 
impossible to concentrate." 

"How can New Yorkers ever get any work done? Trucks rumbling past, sirens 
screaming, buildings going up or coming down, streets periodically torn up—you 
can't hear yourself think!" 


These represent a few of the many opinions people express about the effect of 
distraction on work. What do the facts show? 

Any attention-arousing stimulus that is irrelevant to the task at hand may 
serve as a distractor. Besides noise, common sources of distraction include the 
presence of other persons, anything in motion, bright lights or vivid colors, 
internal sensations of pain or discomfort, excesses of temperature, pleasant 
thoughts, and worries. In industrial settings, offices, and other work situations, 
however, noise is probably the most common distractor. Hence investigations 
of distraction have usually been concerned with noise. The results would 
probably be quite similar if other distractors were used. 

Research on the effects of noise on performance has yielded many appar- 
ently inconsistent results. This does not mean that the question is unanswer- 
able, but rather that the answer is complex. Let us begin by noting the fact 
that laboratory experiments often show no decline in gross output measures, 
even when fairly drastic noise distractions are introduced. In one experiment, 
two groups of students took an intelligence test under normal conditions and 
were retested after six weeks with a parallel form of the test. While the control 
group took the retest also under normal conditions, the experimental group 
was subjected to a variety of distractions, including phonograph records, 
whistles, bells, buzzers, shop machinery, a spotlight, and students performing 
stunts in the room or marching in step on the floor above G6). With all this 
disturbance, the experimental group showed only a little less improvement on 
the retest than did the control group, the difference failing to reach statistical 
significance. 

One of the first clues to an understanding of such findings was provided by 
an early laboratory study (59). While subjects were engaged in translating 
coded material on a typewriter, several loud noises were introduced unex- 
pectedly. Although output did not differ significantly during quiet and noisy 
periods, what did differ was the amount of pressure the subjects exerted on the 
typewriter keys, This pressure was recorded through a device attached to each 
key. When working under noisy conditions, the subjects pressed harder on the 
keys than when working under quiet conditions. When working against dis- 
traction, they also tended to read the numbers and letters aloud or at least 
move their lips. Such accessory reactions probably helped them to concentrate 
upon the task, much as one might do when trying to read while people are 
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talking nearby. In their efforts to compensate for the distraction, subjects 
sometimes overcompensate and actually perform better under the morc stress- 
ful conditions. In so doing, however, they use more energy. Hence work under 
distracting conditions may be more costly to the organism. 

This hypothesis was supported by later experiments providing more direct 
measures of energy cost (24, 32, 42). In one study (24), subjects added 
columns of figures under quiet and noisy conditions. A system of rotation was 
used to balance out practice and other cumulative effects. Although the intro- 
duction of noise caused a temporary drop in number of correct additions per 
work period, output gradually reached the same level observed during quiet 
periods. During noisy periods, however, the subjects used more energy, as in- 
dicated both by a respiration calorimeter and by measures of muscle-action 
potentials. Figure 57 gives the average results for action potentials from the 
four limbs. The graph shows that muscular tension was greater under noisy 
than under quiet conditions. Over the 12-day period, however, muscle-action 
potentials decreased until they reached approximately the level of the quiet 
periods. Essentially the same results were obtained with regard to oxygen 
consumption. 

The above experiment thus demonstrates a further point, namely, physio- 
logical adaptation to noise with continued exposure (47, p. 199). lt seems 
that, when unfamiliar noises are first introduced, they produce a startle re- 


Fig. 57 Average magnitude of muscle-action potentials on successive 
days of work under quiet and noisy conditions. (Adapted from 
Freeman, 24, p. 358.) 
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action that may interfere with work. When well motivated, subjects quickly 
bring output up to their normal level by exerting more effort. With time, 
however, the noise is no longer distracting, and energy expenditure itself drops 
back toward a normal rate. Whether such adaptation is partial or complete 
cannot be ascertained without more evidence. It may be that work under noisy 
conditions will always remain at least a little more costly than work under 
quiet conditions. Moreover, the amount and rate of adaptation undoubtedly 
vary with the intensity of the distraction, the nature of the task, and the in- 
tellectual and emotional characteristics of the individual. 

The degree of annoyance engendered by a noise, as well as its effects on 
output, depends upon several characteristics besides its loudness. Intermittent 
or irregular noises are more disturbing than steady noises; and higher pitches 
are more disturbing than those in the middle or lower ranges (3, pp. 156-157; 
43; 47, pp. 202-203). In many situations, the disruptive effects of a distraction 
result not so much from the intrinsic properties of the stimulus as from the 
individual's emotional response to it. This is particularly true when the dis- 
traction is caused by some inconsiderate action on the part of another person, 
such as supervisor, co-worker, fellow student, or neighbor. 

It should also be noted that, even when subjects are not working on any 
assigned task, mere exposure to noise produces an increase in muscle tension 
and other physiological changes similar to those occurring during emotional 
excitement (see 3, pp. 154-155; 47, P. 199). Under these conditions, too, 
adaptation effects have been observed. 

That the effects of distraction on output depend upon the nature of the 
task has been repeatedly demonstrated. Noise has little or no effect upon the 
performance of highly automatic tasks requiring a minimum of attention (3, 
5). The effects of noise likewise depend upon the amount of previous experi- 
ence the individual has had with the task (39; 56, p. 74). Like extremes of 
temperature and other environmental stresses, noise is less disruptive the more 
thoroughly learned the performance. s 

Research with vigilance tests suggests that the detrimental effects of noise 
are most pronounced in tasks requiring fexibility of attention, as when a 
single operator monitors several indicators Csee 56, p. 74)- Laboratory experi- 
ments in general indicate that the principal effect of noise is to increase the 
frequency of momentary lapses of efficiency (5). While output rate and other 
gross indices of performance may remain unaffected, errors and accidents are 
thus likely to increase. It will be recalled that fatigue has a similar effect on the 
performance of tasks demanding a high level of attention (Chapter 7). All 
such stressful conditions probably affect behavior in a similar way. - 

A different approach is exemplified by McBain’s recent application of the 
concept of arousal to the noise problem (46). Considering the joint effects of 
environmental noise and task uniformity, McBain attributes monotony to a 
total work situation that offers little variation and hence little opportunity for 
arousal. This hypothesis was suggested in part by the research on sensory 
deprivation discussed in the preceding section. For maximal effectiveness of 
performance, McBain argues, the degree of arousal should be at an optimum 
level, which probably varies with the individual. Either too little or too much 
arousal will have deleterious effects on performance. He also observes, how- 
ever, that noise external to the task at hand should have a minimum of “intel- 
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ligibility" for the individual, in order to minimize its distracting power. In 
other words, it should have little meaning for the individual. Parenthetically, 
it might be added that this concept of intelligibility would help to explain why 
the student whispering across the library table may prove more distracting 
than the drilling, pounding, and crashing noises from the construction job 
across the street or why a political speech on the radio is more distracting than 
music. 

In an experiment to test the arousal hypothesis, McBain chose a uniform 
but attention-demanding task, requiring the subjects to hand-print seven pairs 
of letters in a continuous sequence at a mechanically paced rate. Each work 
session lasted 42 minutes. A series of unintelligible noises varying widely in 
intensity and content was employed during the noisy sessions. McBain notes 
that, in many laboratory studies of noise, highly homogeneous noises have been 
employed. Such noises may simply increase the total monotony of the work 
situation. A variable noise, he maintained, should improve the performance 
of an intrinsically monotonous task. The subjects were 26 members of the 
Canadian Air Force. A counterbalanced order of quiet and noisy sessions was 
followed with two equated subgroups. 

Although this experiment included several parts, only one will be reported 
here. This part yielded the most clear-cut results and is also the most elevant 
to the present topic. The interested reader is urged to examine the original 
article. For the present purpose it will suffice to note that the subjects made 
significantly fewer errors under the noisy than under the quiet conditions, 
thus supporting McBain’s hypothesis. These findings should be borne in mind 
in considering the use of music in industry, to be discussed in the next section. 

But first let us inquire briefly into the findings of industrial studies on 
noise and distraction. Unlike some of the laboratory studies cited by McBain, 
these investigations deal with complex and generally varying noise back- 
grounds. In this respect they would be expected to yield results similar to those 
of McBain’s study. However, the “quiet sessions” in the industrial studies in- 
volve, not a cessation of noise stimuli, but rather an over-all reduction in their 
intensity through the use of soundproofing, sound baffles, or earplugs. Hence 
it can be argued that the “arousal” effect of the varied noise background re- 
mains relatively unaltered. 

The interpretation of results from most industrial studies is questionable 
because of many uncontrolled variables, such as amount of job experience, fi- 
nancial incentives, and the ever-present specter of the Hawthorne effect G3; 
6; 47, pp. 200-201). It has been suggested that the ideal experimental design 
for this problem would require that all workers be issued earplugs, some of 
which are defective and hence function like the placebos used in drug studies 
(see 6). Only in this way could we clearly separate the effects of noise reduc- 
tion as such from the attitudinal effects of merely introducing a change de- 
signed to improve employee comfort. 

The next best type of control involves having the same employees working 
under no and quiet conditions in a balanced or random sequence. Under 
these circumstances, improvement attributable to morale effects would tend to 
be more general, while that attributable directly to noise reduction would tend 
to be limited to the quiet periods. Such an investigation was conducted with 
a group of workers tending machines that insert the perforations in motion: 
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picture film (6). The operators thread the film on one machine while the 
film is running through another machine. Payment is based on rate of work 
as well as film breakage and other operational breakdowns. The task is de 
scribed as being sensitive to momentary lapses of attention and hence well 
suited to the investigation of distraction. All operators normally shifted from 
room to room in a six-week cycle. Some of these rooms remained at their 
original noise levels, while others were treated for noise reduction by placing 
absorbent materials on walls and ceilings and absorbent baffles between the 
rows of machines. The sound level in the treated rooms was reduced from 
about 99 db to about 9o db. 

Performance data were gathered for several six-week periods distributed over 
two years. Following noise reduction, output rose and absenteeism declined 
throughout, with no significant differences between treated and untreated 
areas. Such changes can thus be attributed to improved morale and other gen- 
eral factors. Errors, on the other hand, declined significantly more in treated 
than in untreated areas. These findings are consistent with those of laboratory 
studies in that they indicate fewer momentary lapses of attention when en- 
vironmental stress is reduced. 


MUSIC IN INDUSTRY 


Industrial music had its origins in early work songs, which served chiefly to 
pace the rhythm of motor activities but undoubtedly helped also to relieve 
monotony and improve morale. Although music is now used widely in Ameri- 
can industry, there are few controlled studies of its effects and results are in- 
conclusive (78). Investigations of simple assembly operations and other repeti- 
tive johs have reported slight increases in output, reduction in scrappage rate, 
and no vhange in accident frequency following the introduction of music dur- 
ing work (37, 4o, 7o, 89). 

On the other hand, a well-controlled study of a somewhat more complex in- 
dustrial job yielded negative results (53). The subjects were 142 women em- 
ployed in the manufacturing of rugs. Their particular job required a high 
level of mental and manipulative skill, good visual memory and color discrim- 
ination, and the ability to attend to a variety of job demands. Two to four 
years of experience are generally required to become a skilled operator. During 
each week of the five-week experimental period, music was played for 80 min- 
utes on each of four days and not played on one day. The programs on the four 
days were the same with respect to type of music but differed in the time 
of day when music was played. These programs as well as the no-music day 
were rotated during the five-week period. Output records for music and no- 
music days showed no significant changes that could be attributed to music. 
As an explanation of these negative findings, the authors suggested that over 
the long periods of prior job experience these workers had developed stable 
patterns of work and work tempo that remained uninfluenced by relatively 
mild changes in the working environment. 

Surveys of employee attitudes, however, have consistently revealed pre- 
dominantly favorable reactions to the introduction of music during work. 
When the workers in the above study were asked, “Do you want us to con- 
tinue playing music in this department?” 84.5 per cent answered affirmatively, 
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14.5 per cent said it made no difference, and only 1 per cent answered in the 
negative (53). Among the most frequently cited effects of music on work were 
that it reduces monotony, makes time pass more quickly, makes work easier, 
and gives you a lift. It is also interesting to note that 59 per cent of the work- 
ers reported that they got more work done with music, despite the fact that 
objective output records showed that they did not. 

It is entirely possible that, at least in some jobs, music serves to improve 
attitudes and job satisfaction without appreciably affecting output. Such a 
finding would be quite consistent with the results of research on employee 
attitudes discussed in Chapter 6. 

At this point it may be of interest for the reader to turn back to the four 
opinions cited on page 211 and reexamine them in the light of experimental 
findings on noise and music. 


SUMMARY 


Research on the working environment has been concerned with the effects 
of such factors as illumination, ventilation, noise, and music upon job perform- 
ance and employee well-being. Discrepant findings and controversy are com- 
mon in this area because of the use of different criteria of efficiency as well as 
inadequate experimental designs. With regard to illumination, research has 
concentrated on the establishment of minimum standards of intensity for dif- 
ferent types of work, determination of the optimum distribution of light to 
minimize visual fatigue, and a comparison of vision under different colors of 
light. 

The primary object of ventilation is to enable the body to give off excess 
heat through evaporation, radiation, and convection. Industrial studies have 
shown the relation of ventilating conditions to quantity and quality of produc- 
tion, accident rate, and absenteeism. Suggestive results on the behavioral 
effects of weather, seasons, and climate were obtained in a number of early 
studies. In recent years, there has been a revival of interest in these problems. 
Research is also being conducted on the psychological consequences of ex- 
posure to extreme atmospheric conditions, among which oxygen deprivation 
has received the greatest attention from psychologists. : 

Chief among the special psychological problems of space travel are those 
arising from extreme and stressful environmental conditions. The physical con- 
ditions investigated in this connection include weightlessness, rotation, high 
accelerative forces, and the conflicting sensory cues leading to spatial dis- 
orientation. A major source of psychological stress is isolation, comprising the 
factors of confinement, detachment, and sensory deprivation. 

Noise has been investigated as a cause of temporary or permanent hearing 
impairment, as a source of interference with auditory communication, and as 
a major form of distraction. The effects of noise distraction on behavior are 
complex. Their proper understanding requires data, not only on output, but 
also on energy expenditure, adaptation, and emotional response to the source 
of distraction. The results depend upon the interaction of various character 
istics of the distracting stimuli, the task, and the individual. With regard to 
the use of music in industry, the few well-controlled studies of its effects sug 
gest that it tends to improve output in simple routine jobs but may have little 
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or no effect on more complex operations performed by experienced workers. 
Employee opinions, however, are predominantly in favor of music during 
W ork. 
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In current usage, "engineering psychology" is a rather flexible term. This 
flexibility also characterizes its many synonyms, such as "human engineering," 
“human-factors research," and “ergonomics.” All these terms are sometimes 
used in a broad sense to cover practically every topic in industrial psychology, 
including selection, training, motivation, work methods, working environment, 
and equipment design. A more common usage restricts the coverage to the 
last three topics, as has been done in this book. A still narrower usage identi- 
fies engineering psychology with equipment design. The present chapter thus 
deals with engineering psychology in this last and most restricted sense. 

From another angle, equipment design extends beyond industrial psychology 
into some of the fields covered in other parts of this book. The techniques of 
engineering psychology are now being applied more and more to the designing 
of consumer goods. Psychologists are contributing, not only to the designing 
of industrial machinery, military equipment, and space vehicles, but also to 
the designing of such products as kitchen stoves, radios, and telephone dials. 
The latter type of research will be considered in Chapter 1o, in connection 
with consumer psychology. Still another application is provided by improve- 
ments in highway design, road signs, traffic lights, and other matters that 
concern the general motorist, which were discussed in Chapter 7. The basic 
methodology and principles considered in the present chapter are equally 
relevant to these other applications. 


HUMAN FACTORS IN EQUIPMENT DESIGN 


Although engineering psychology has only recently attained the status of a 
fully developed field of applied psychology, its antecedents can be traced to 
the earliest beginnings of industrial psychology. As early as 1898, Frederick 
W. Taylor, whose time study was discussed in Chapter 7, applied systematic 
observation of worker performance to the designing of shovels C7, vol. II, ch. 
5). Taylor found that, owing to the haphazard selection of shovel shapes and 
sizes, shovelers in a steel plant were lifting loads varying from 3⁄2 to 38 
pounds. Through empirical tryout of different loads, he ascertained that 2114 
pounds constituted an optimal load for men working regularly as shovelers. 
When either heavier or lighter loads were tried, total daily output declined. 
Accordingly, he designed different sizes of shovels for different materials, so as 
to yield approximately this load. Thus a small spade-shovel was provided for 
handling heavy iron ore, and a large scoop for shoveling ashes. 

Frank B. Gilbreth, of motion-study fame, also devoted considerable effort to 
designing special chairs for different types of work, in order to minimize strain 
and fatigue (27). One of these was a high chair with a footrest, to be used 
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with a tall desk or worktable. With such an arrangement, the operator could 
work equally well while either sitting or standing and could thus shift body 
posture to relieve muscular strain. Among his classic "rules" for motion econ- 
omy, Gilbreth included several dealing with the arrangement of the work- 
place, the position of tools, and the design of equipment. His writings include 
many observations and recommendations on all these ways of improving 
worker efficiency (e.g., 26, ch. 3). 

Two investigations (38, 48) conducted in Germany in the 1920s on the rela- 
tive effectiveness of different sizes and shapes of handles for cranks and screw- 
drivers anticipated some of the later research on tool designing. During the 
same decade, the Great Britain Industrial Fatigue Research Board was con- 
cerned with ways of altering existing machines to facilitate their use by the 
human operator (39, 63). In one survey sponsored by this board, the focus 
of interest was described as “a large class of machines requiring frequent 
manipulation of a release or control or some other manual operation ir which 

. the worker and the machine form a single system” (39, p. iii). This 
statement has a remarkably modern ring to it and could have been tak. > from 
a current publication on engineering psychology. The survey led to 'ecom- 
mended improvements in several types of laundry and leatherworking ma- 
chines, as well as in certain machines used in the manufacture of textiles, 
shoes, and brushes, in sheet-metal work, and in tobacco processing. 

As a final point of historical interest, it might be noted that, in a report on 
psychologists’ activities during World War I, Raymond Dodge referred to “the 
new problems of human engineering which modern warfare occasioned” (see 
54). Dodge’s own contributions during World War I include a pioneer attempt 
to design a wearable gas mask, as well as the development of selection tests 
and training devices. 
| It was during World War II, however, that engineering psychology rose 
into prominence as a separate discipline. The immediate impetus for its emer- 
gence came from the increasing complexity of the machines that men were 
called upon to operate. High-speed aircraft, rockets, and radar were among the 
many new instruments whose operation taxed human capacities to unprece- 
dented degrees. It soon became apparent that selection and training were in- 
adequate to provide a sufficient number of operators who could function at 
the required level of proficiency. The solution was to redesign the equipment 
so as to make the operator’s job simpler, less confusing, less demanding, or in 
other ways more manageable, This might be accomplished by improving the 
displays (such as dials) through which information is received, by making the 
controls easier to operate, or by having the machine take over some of the 
complicated intervening processes formerly carried out by the human operator. 

The same techniques of equipment design, of course, can be applied to im- 
proving productivity in any type of job, regardless of its initial complexity ot 
difficulty. The improvement that can be effected by such procedures is usually 
far greater than that ordinarily obtained through the selection or training of 
operators or the introduction of more efficient work methods (57). Moreover, 
equipment redesign permits a fuller utilization of available manpower by re 
ducing the ability requirements of the job or by making provision for special 
disabilities. For instance, some research has been done to identify ways in 
which industrial equipment might be modified for more effective use by older 
workers (31). 
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Present-day engineering psychology differs in a number of ways from earlier 
work in the design of equipment. The most conspicuous difference is in its 
magnitude. Since its formal beginnings in World War II, engineering psy- 
chology has been expanding at a prodigious rate. Its bibliographies run into 
many thousands of titles (6, 13, 42, 45, 47). Much of this research is con- 
ducted under military sponsorship, but industrial participation is increasing 
steadily. In contrast to early, scattered efforts, modern engineering psychology 
approaches the problems of equipment design more systematically. This can 
be illustrated by the many interrelated studies exploring the variables affecting 
tracking performance, to be discussed in a later section. Another example is 
provided by the publication of several handbooks summarizing normative data 
on sensory, motor, and other relevant human capacities for use by engineers in 
the design of equipment (43, 47, 59, 62). 

Another differentiating characteristic of engineering psychology is its strong 
research orientation. Earlier work on equipment design was often based on 
superficial observation of workers followed by "commonsense" recommenda- 
tions. Although the more glaring equipment deficiencies could be corrected 
in this fashion, the “commonsense” approach is quite limited and may at times 
lead to false conclusions (4, pp. 5-10). Today the relative effectiveness of dif- 
ferent types of equipment is investigated empirically under controlled condi- 
tions. 

A major feature of current engineering psychology is its comprehensive ap- 
proach. Beginning as "knob-and-dial" research, which dealt with rather spe- 
cialized equipment problems, engineering psychology is becoming increasingly 
concerned with man-machine systems (19, 55, 57). The latter involve a com- 
posite analysis of all that is done by the machines and by the operator or team 
of operators required to carry out a complex task. It is in such systems research 
that the most spectacular progress is being made today. Moreover, it is chiefly 
through this research that the applied field of engineering psychology is in 
turn beginning to make contributions to basic psychological knowledge. 


SPATIAL LAYOUT OF WORK 


Among the earliest recommendations of time-and-motion analysts were those 
pertaining to the spatial arrangement of the workplace, tools, and supplies. 
Materials should be arranged in the order in which they are to be used, so 
that the worker need not cross and recross his path of motion. Moreover, hav- 
ing a regular place for each article eliminates search. Further savings are 
effected by pre-positioning tools in holders in such a way that they are grasped 
in the position in which they will be used. Holders for pens provide a simple 
and familiar example of this principle. Several holders have been devised for 
pre-positioning screwdrivers and other small tools (2, pp. 288-290). 

The application of knowledge regarding human dimensions and movement 
characteristics to the designing of work layout is illustrated by the semicircular 
work space. In an assembly job, for example, the bins containing the parts to 
be assembled should be arranged, not in a straight line, but in a semicircle 
in front of the worker, as illustrated in Figure 58. Such an arrangement 
eliminates the excessive reaching required for the end bins in a linear arrange- 
ment. Figure 58 also shows the outer dimensions of. the so-called “normal” and 
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Fig. 58 Arrangement of bins for an assembly operation. Normal and 


maximal semicircular areas have been superimposed. (From Barnes, 
2, p. 250.) 


"maximum" working areas. The former is determined by sweeping the fore- 
arm across the table, with the upper arm hanging against the body and with 
the elbow as a pivot. The latter is found in the same fashion, except that the 
whole arm is now rotated, with the shoulder as a pivot. The area where the 
semicircles defined by the right and left hands overlap represents a zone 
where two-handed work can be performed most conveniently. 

The semicircular work space is appropriate for any situation where the 
operator must reach for a number of items in a repetitive sequence The 
collating of papers and the stuffing of envelopes by a clerk represent familiar 
examples. It might appear that the saving in time and energy effected by the 
semicircular work space are trivial. This is true if we consider the collating of 
a single set of papers or the assembly of a single article. But in a repetitive job 
the savings mount up to impressive figures. In one large radio-manufacturing 
company, for instance, it was estimated that, by shortening the reach to each 
supply bin by 6 inches, the total saving in time was 34, hours a year and 
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the saving in distance traveled by the operators’ hands was 98,500 miles a 
year! (See 2, p. 254.) 

For most purposes, the semicircular pattern provides a sufficiently accurate 
guide for defining the most effective work space. Human movements, how- 
ever, rarely fit into such simple geometric forms. A detailed empirical study 
recently conducted under Navy auspices demonstrated that the normal work- 
ing area is more limited in extent and is defined by a more complex figure, 
known in mathematics as a “prolate epicycloid" (53). This figure—which 
does not look quite so complicated as it sounds—is shown in heavy solid line 
in Figure 59. The traditional semicircular areas corresponding to normal and 
maximum working space have been superimposed for comparative purposes. 
The deviation from a semicircle results chiefly from the fact that, when the 
forearm moves outward, the elbow does not remain in a fixed position but 
moves out and away from the body. Consequently, the figure is somewhat 
flattened, and the right and left corners are chopped off. This type of curve 
is useful not only in designing a flat working surface but also as a guide for 
the construction of control panels and other vertical working areas. For the 
latter purpose, the vertical surface is erected perpendicularly along the outer 
contour of the figure, which is first flattened out at the center as shown by 
tne dotted line in Figure 59. 

Another illustration of research on work layout is provided by the type- 
writer keyboard (8, 9, 34). The arrangement of characters on the standard 
keyboard makes insufficient use of the more efficient fingers, while overloading 
the less efficient. Empirical surveys provide a count of the frequency of occur- 
rence of the 26 letters of the alphabet in the most commonly used English 
words, as well as the frequency of punctuation marks, capitals, and other char- 
acters in large samples of business letters. The relative ability of the eight 
fingers used for typewriting has been investigated by means of tapping tests 
and by analyses of the number of errors made by each finger C9, 34). 


Fig. 59 Modified normal working area, with traditional semicircular 
areas superimposed. Dimensions shown are in inches. (Adapted from 
Squires, 53, p. 2.) 


Semicircular — — — Semicircular 
normal normal maximum 
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On the basis of all this information, the "simplified" keyboard reproduced in 
the right-hand portion of Figure 6o was developed. The load carried by each 
finger of the right and left hands is also shown in Figure 6o. This distribution 
of load can be contrasted with that obtained with the standard keyboard, 
given in the left half of Figure 6o. It will be seen that with the standard key- 
board the less efficient left hand carries more than half of the typing load, in- 
sufficient use is made of the relatively efficient right middle finger, and the 
second, or "home," row is used less frequently than the less accessible third 
row. Research conducted with high school and college students and with 
schoolchildren over a period of several years indicated that the simplified key- 
board requires less learning time, increases typing speed, decreases errors, and 
produces less fatigue than the standard keyboard (8). A major obstacle in the 
way of introducing an innovation of this sort, of course, is the resistance of 
typists who have become expert with the old keyboard. 


DISPLAY PROBLEMS 


In the terminology of engineering psychology, a display is a device used to 
present information indirectly by symbolic or pictorial means. Examples of 
common displays include speedometers, thermometers, pressure gauges, gas 
meters, altimeters, doorbells, and fire gongs. With advancing automation, 
there is an increase in the number and complexity of displays to which the 
human operator must successfully respond. Research has shown that improve- 
ments in the design of displays can lead to substantial savings in time and 
energy and a marked reduction in accident risk. 

Visual Displays. By far the most common types of displays are visual. 
Such displays may become very complex, as illustrated by the instrument 
panels of modern aircraft. Engineering psychologists have studied many aspects 
of visual displays, such as dial size and shape, length of scale units, number 
and spacing of scale markers, direction of pointer movements, size and style of 
letters and numerals, and position of displays in relation to the normal posi- 
tion of the operator's head and eyes (see 43, chs. 4 and 14). Some experi- 
mental findings indicate an interaction of different variables in the subject's 
performance. For example, certain dial characteristics are more effective in 
situations calling for rapid reading, while others are more effective when the 
operator can take as much time as he needs for careful reading. Thus, in rapid 
reading, greater accuracy is usually achieved with a dial having fewer scale 
markers and hence requiring some interpolation. On the other hand, a dial 
with individual markers corresponding to each scale unit yields better results 
when adequate reading time is available (43, ch. 4). 

Several experiments have been concerned with dial design. One experiment 
with 60 male college students compared the five types of dials reproduced in 
Figure 61, including vertical, horizontal, semicircular, round, and open-win- 
dow (50). In the first four dial types, the pointer moves along the dial scale; 
in the open-window types, the pointer remains fixed, and the dial scale moves. 
Seventeen settings were presented with each dial, each setting falling on one of 
the scale markers Ci.e., either a whole number or a half number). Exposure 
time for each setting was 0.12 second. An error was scored when the subject 
either gave an incorrect reading or was unable to give a reading during the 
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Fig. 61 Percentage of errors in reading five types of dials. 
CAdapted from Sleight, 50, pp. 177, 181.) 


brief exposure. As can be seen in Figure 61, the smallest percentage of errors 
ote os with the open-window design, the largest with the vertical 
ial (35.5). 

One factor determining the relative accuracy of reading different types of 
dials seems to be the area that the observer must scan in order to make @ 
reading. In the above experiment, the greater the area, the higher was the 
error frequency. The open-window type provides the most compact design, 
the circular the next most compact. The semicircular and horizontal scales 
require the eye to cover increasingly more ground. It is also relevant to note 
in this connection that with semicircular and horizontal scales errors were 
more common at the extreme positions. The difference between horizontal an 
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vertical scales, on the other hand, is undoubtedly related to the fact that 
horizontal eye movements are easier and faster than vertical eye movements. 
Customary reading habits, involving the horizontal scanning of the page, 
would also tend to favor the horizontal type of scale. It is noteworthy that the 
advantage of horizontal as compared with vertical scales persists when longer 
exposure times are used (29). 

Another variable that affects the results of dial-reading experiments is the 
type of reading required. The above experiment called for quantitative read- 
ing, in which the actual numerical value must be determined. In such dial 
reading, the open-window and moving-tape design has generally yielded the 
greatest speed and accuracy. In qualitative or check-reading, on the other 
hand, the purpose may be simply to determine whether the indicator is within 
a normal or safe range or whether the equipment is or is not functioning 
properly. Moreover, qualitative reading may be concerned with the direction 
and rate of movement of an indicator, as well as with its position. For this 
type of reading, the dial and moving-pointer display is superior to the open- 
window type (11). Such findings again illustrate the importance of consider- 
ing the interaction of different variables in drawing conclusions from applied 
research. It should also be noted that engineering considerations may make 
the use of the psychologically optimal display impracticable in certain situa- 
tions. Under such circumstances the compromise solution must utilize both 
physical and psychological facts in the best possible way. 

One experiment on check-readings was concerned with the simultaneous 
monitoring of groups of dials (49). The subject's task was to identify in each 
panel the one dial whose pointer was not in the normal position. As illustrated 


Fig. 62 Speed of check-reading dials with normal pointer positions 
aligned and not aligned in panel. (Data from Senders, 49, p. 21; 
chart prepared by Aero Medical Laboratory, United States Air Force.) 
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in Figure 62, two arrangements were used for the normal pointer positions. In 
one arrangement, the normal position varied from dial to dial, being defined by 
the location of a small red arc on the circumference of the dial. In the other, 
the normal position was the same in all dials, the pointers being in a horizontal 
“nine o'clock" position. Various numbers of dials up to 45 were employed 
with each arrangement. 

Reference to the graph in Figure 62 shows that the time required by the 
subjects to identify the deviant dial increases with the number of dials in the 
panel, as would be expected. The increase is much steeper, however, when 
the normal pointer positions are not aligned than when they are aligned on 
the different dials, In 0.5 second, for instance, subjects could check-read as 
many as 32 aligned dials, but only 4 nonaligned dials. Even the largest number 
of dials employed in the experiment, 45, could be check-read in less than 1 
second when pointers were uniformly aligned. Increasing the number of dials 
makes little practical difference under these conditions. 

A final example of research on visual displays is provided by studies on 
airplane-attitude indicators. The “attitude” of an airplane refers to its relation- 
ship to the earth, that is, whether the airplane is climbing, diving, or banking 
and the amount of its tilt from the horizontal. Two ways of showing this re- 
lationship on an indicator are illustrated in Figure 63. In the lefthand in- 
dicator, the position of the airplane is fixed, while the horizon turns; in the 
right-hand indicator, the horizon is fixed, while the airplane turns. Both of the 
indicators in Figure 63 show a left roll of the airplane. 

The left-hand indicator corresponds to what one would sce if he looked out 
the window, since when the airplane rolls the horizon appears to turn. This is 
also the way attitude indicators were originally designed. Nevertheless, it was 
the right-hand indicator with fixed horizon and turning airplane that gave the 
more accurate results and required less training time (3, 20, 21, 41). It was 
this arrangement, too, that was preferred by most subjects, including experi- 
enced pilots who had been using the old type of indicator. Apparently in this 
situation intellectual realism, based upon the knowledge that the airplane tilts 
and the horizon remains fixed, is more important than visual realism. 


Fig. 63 Two types of airplane-attitude indicators. (From McCormick, 
43, p. 388.) 
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Auditory Displays. Although relatively little use has been made of au- 
ditory displays, their potentialities are being explored in more and more situ- 
ations. Because of their attention-compelling nature, auditory signals are espe- 
cially suitable for reporting emergencies. While the observer might be looking 
elsewhere or might close his eyes, his ears are always "open" to stimuli. Au- 
ditory displays are also useful in situations where heavy demands are made 
upon visual attention. The substitution of auditory displays for one or more 
of the visual displays in these situations would remove some of the overload 
from the visual modality. Such is the case, for instance, in "blind flying," when 
the number of instruments to be monitored is very large. 

As an aid in blind flying, or instrument flying, the military services have 
experimented with a system of auditory signals known as "Flybar" (Flying by 
auditory reference). In this system, sound signals are employed to provide 
information regarding airspeed, turn movements, and banking movements. 
Although Flybar has not been used operationally, it has yielded some research 
data having broad implications for the use of auditory displays in general (18). 
For example, auditory signals should be as realistic as possible and should tie 
in with habitual associations. Thus air speed may be indicated by varying the 
rate of a continuous series of “putt-putt-putts,” commonly associated with the 
sound of a motor, It was also found that a three-in-one signal, in which differ- 
ent characteristics of the sound stand for different types of information, was 
better than three separate signals. In the latter case, one signal was likely to 
catch and hold the attention for a time, to the neglect of the others. Combin- 
ing the information into a single signal permitted the operator to attend to all 
three kinds of data. 

Another example of auditory displays is provided by the radio-range signals 
employed in aerial navigation. These signals are produced by using two di- 
rectional radio beams at right angles to each other. One beam transmits the 
letter A by Morse code (dot-dash); the other beam transmits the letter N 
Cdash-dot). The signals are so generated that if the aircraft is “on the beam,” 
or flying just between the two crossed beams, a steady tone results. If the air- 
plane veers too far to the left or right, the A or N will be heard instead. Still 
another illustration of auditory displays is to be found in sonar, a technique 
for identifying underwater objects and for determining the direction of their 
movement. Pitch discrimination is crucial in this operation. 

To all these examples of auditory displays we must of course add the very 
important medium of speech communication. Some reference to research on 
speech intelligibility was made in Chapter 8. In that chapter we also con- 
sidered investigations on the effects of noise on all forms of auditory communi- 
cation, an area of research that also constitutes a major part of the present 
topic. 

‘Other Sense Modalities. In recent years, some exploratory research has 
been conducted on the communication possibilities offered by still other sense 
modalities, notably the kinesthetic and the tactual. Thus the “feel,” or amount 
of resistance encountered in operating a stick, lever, or other control, may 
convey relevant information to the operator. The distance over which a con- 
trol is moved may also be perceived through kinesthetic cues. And different 
controls may be identified tactually by the shape of their knobs. Some illustra- 
tive research on the shape coding of knobs will be cited in the following 
section, dealing with control problems. The skin senses have also been in- 
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vestigated as a more general medium of communication, through the use of 
either electrical (28, 33) or mechanical (24, 25) stimulation. 

A dramatic demonstration of “tactile literacy" has been provided by Geldard 
and his students (24, 25). By applying a vibratory stimulus to any one of five 
chosen areas on the chest and employing three durations (0.1, 0.3, and 0.5 
second) and three intensity levels, Geldard was able to produce an "alphabet" 
of 45 discriminable tactual sensations (5X3X3). Years of psychophysical re- 
search on tactual sensitivity preceded the development of this system. For pur- 
poses of communication, the 26 letters, 10 digits, and a few common short 
words were assigned to the elements of this “vibratese language.” The appa- 
ratus for sending messages is operated by means of a typewriter. It was found 
that subjects can learn this alphabet about as quickly as they can learn Morse 
code. The receiving speed attained, moreover, was substantially higher than 
that of experienced Morse code operators. 


CONTROL PROBLEMS 


A control is a device for utilizing human effort in activating or directing a 
machine. Controls include a wide variety of handles, levers, handwheels, 
cranks, foot pedals, and the like. The steering wheel of an automobile is a 
familiar example. In designing controls, the engineering psychologist draws 
upon much of the basic research on human motor activities, such as the studies 
cited in Chapter 7. He might, for example, take into account the relative ac 
curacy, speed, or strength of hand movements in different directions, the 
accuracy of blind positioning movements at various points, or the conditions 
that affect the speed of serial movements. In addition, a considerable body of 
applied research has dealt more specifically with the operation of controls. 

Tracking. A task that has been extensively studied by engineering psy. 
chologists is that of tracking a moving target. Tracking is basic to several 
military operations, such as those involved in radar, gunnery, and fire control. 
It is also required in certain industrial tasks and other civilian activities, such 
as driving a car. When the driver turns the steering wheel to keep the car 
moving within the proper lane, he is carrying out a tracking operation. 

In tracking experiments, the subject must make continuous adjustments 
with a crank, wheel, or other control device in order to maintain alignment 
between a “target” and a “cursor,” or indicator (Figure 64). Tracking may be 
of the pursuit or the compensatory variety. In pursuit tracking, the target 
moves, and the subject must follow this movement with the cursor, which he 
controls. In compensatory tracking, there is a fixed center reference mark, and 
the operation of the control counteracts the target movement so as to keep the 
target aligned with this fixed mark. In compensatory tracking the subject does 
not directly observe either the experimentally controlled stimulus movement or 
the response movement, which he himself controls. What he does observe is 
the error, or difference, between the two movements. Under most conditions, 
compensatory tracking is more difficult than pursuit tracking. 

In several hundred experiments, engincering psychologists have investigated 
the effects of a variety of task variables upon tracking performance (see 1; 
40; 43, ch. 11). Examples of these variables include characteristics of the 
target, such as its rate of movement and size; characteristics of the control, 
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Fig. 64 A typical tracking task. 


such as its position, size, plane of rotation, rate of motion, friction, and inertia; 
ratio between rate of motion of control and of cursor; and type of control ex- 
erted, namely, position, velocity, or acceleration. Some studies have employed 
two-handed tracking, in which one hand controls right-left movement of the 
cursor and the other hand controls front-back or up-down movement. 

The possibility of carrying out two separate tracking tasks with the two 
hands has also been explored (see 1). Such experiments may use two visual 
sources or a visual and an auditory source. This line of research is related to 
the theoretical question of whether man is a “one-channel system" or whether 
he can actually handle two simultaneous operations. Available data on exactly 
what occurs when the subject carries out multiple tasks are as yet insufficient 
to answer this question. 

Relatively few tracking experiments have dealt with procedural variables, 
such as amount and distribution of practice and nature of instructions to sub- 
jects Cx). It is important to know how task characteristics interact with such 
subject conditions as fatigue, motivation, and degree of learning. It is also 
important to know more about the subject’s behavior in carrying out a tracking 
operation. Some research suggests, for example, that the subject's motor re- 
sponses in tracking are essentially intermittent rather than continuous, even 
when they may appear to be proceeding smoothly and without interruption. 
Other research has been concerned with the subject's ability to anticipate 
target movements and the extent to which he may learn to do so with practice 
(1). 

Display-Control Compatibility. The spatial relationships between display 
and control affect the speed and accuracy with which the control is operated. 
Isomorphism, or similarity in pattern of displays and controls, facilitates re- 
sponse (15, 16, 22). For example, if the operator must push a button corre- 
sponding to the particular light that flashes on a panel, he can do so faster and 
with fewer errors if lights and buttons follow the same pattern. If the lights 
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are arranged in a square, the buttons should also form a square; if the lights 
are in a column, the buttons should also be in a column, and so on. And of 
course, within such a pattern, the light and button that belong together should 
occupy corresponding positions. The greater the display-control compatibility, 
the less "translating," or data processing, will be required of the operator. 

As the correspondence between display and control patterns decreases, not 
only does the difficulty of the task increase, but there are also changes in the 
number and nature of the abilities required by the task. These effects were 
demonstrated in an experiment with the task illustrated in Figure 65 (17). 
When a stimulus light flashed on the vertical display panel, the subject was 
to push the corresponding button on the response panel as quickly as possible. 
The position of the arrow on display and control panels served as a reference 
point. Thus, if the light that flashed on was the third to the left of the arrow, 
the subject was to push the third button to the left of the arrow. ‘The first 
illustration in Figure 65 shows the two arrows in identical positions on both 
panels. Seven other positions were investigated, in which the display panel 
was rotated by different amounts. The 9o? position is shown in the right-hand 
half of Figure 65. 

The effect of such rotations on accuracy of performance can be seen in 
Figure 66. It is evident that a o? display rotation, representing complete iso- 
morphism of display and control panels, yields a far greater number of correct 
responses than do any of the other arrangements. The subjects in this study 
had also taken several tests chosen so as to measure a number of abilities pre- 
viously identified through factor analysis. Performance on the easiest task, with 
o? rotation, correlated .47 with perceptual speed and had negligible correla- 
tions with all the other abilities tested. With a 90° display rotation, perform- 
ance correlated only .25 with perceptual speed, but it correlated .69 with 
spatial orientation. With 180° rotation, the correlation with perceptual speed 
dropped to a negligible .o4, but performance now correlated .4o with spatial 


Fig. 65 Two combinations used in studying effects of display-control 


relations on task difficulty and on abilities required. (From Fleishman, 
17, p. 523.) 


O° display rotation 90° display rotation 
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Fig. 66 Number of correct responses in relation to display-control 
compatibility. Display rotation of 0° represents complete compatibility, 
CFrom Fleishman, 17, p. 527.) 


orientation and .4o with response orientation. Changing the relationship be- 
tween display and control thus altered the nature of the task performed by 
the subject. 

When movement of the control determines the direction of movement of the 
display, as in tracking, certain population stereotypes may affect performance. 
Several investigators have identified a number of prevalent associations be- 
tween control movements and expected display movements (see 12; 43, ch. 
14). Some of these associations are well known. For example, the large ma- 
jority of persons brought up in our mechanized culture find it “natural” to 
turn the steering wheel in a clockwise direction in order to turn the car to the 
right. In other contexts, we expect a clockwise movement of a radio dial to 
increase the volume of sound, but we expect a clockwise movement of a water 
faucet to reduce the flow of water. The latter stereotype also applies to steam 
valves. Controls that violate such stereotypes cause annoyance, delay, and error. 

A typical investigation designed to identify some of these population stereo- 
types was conducted by the Air Force (60). Using panels such as those illus- 
trated in Figure 67, the subject was to “move” a light to the center position 
by rotating the knob. Actually rotation of the knob caused the lights in the 
row to turn on and off in sequence. Unknown to the subject, the equipment 
was so adjusted that the light moved toward the center regardless of the direc- 
tion in which the knob was turned at the start of any one trial. Thus no direc- 
tion was differentially rewarded. Any preferred directional association revealed 
by the data would therefore indicate the subject’s own predisposition. 

The three panels yielding the most clear-cut results are reproduced in Figure 
67. Examination of panels A and B would suggest that clockwise movements 
are associated with “right” and “up,” while counterclockwise movements are as- 
sociated with “left” and “down.” That the relation is not so simple as this, how- 
ever, is indicated by the results obtained with panel C. Here most subjects 
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Fig. 67 Some examples of population stereotypes in display-control 
relationships. (Adapted from Warrick, 60.) 


chose counterclockwise for upward and clockwise for downward movements. 
The answer seems to be that the preferred relation is one in which the display 
moves in the same direction as the side of the control nearer to it. When the 
position of the control is reversed, as in panels B and C, the “meaning of 
clockwise and counterclockwise rotation is correspondingly reversed. —— 

Coding of Controls. The rapid and correct identification of controls is im- 
portant in many situations. Turning on the hot-water faucet when we want 
cold water may be annoying or momentarily painful. Stepping on the ac- 
celerator instead of the brake pedal in a car or reaching for the wrong control 
in an airplane can prove fatal. In an Air Force survey conducted during World 
War II, for example, confusion between landing-gear and flap controls was re- 
ported to have caused 457 aircraft accidents within a 22-month period Ct 
p. 607). 
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The identification of controls can be facilitated by coding each control in 
terms of some readily observable characteristic, such as its location, size, shape, 
or color. Thus if the cold-water faucet is always on the right and the hot- 
water faucet on the left, or if the cold-water faucet has a blue knob and the 
hot-water faucet a red knob, confusion will be greatly reduced. Standardization 
of such features is thus one way to facilitate the use of controls. The fact that 
controls are often in different positions in different models of cars, airplanes, 
or other equipment interferes with their effective operation (61). Under these 
conditions, uniform coding of controls with regard to shape, color, or some 
other characteristic will reduce the interference effects when the operator 
changes from one model to another. 

Another factor determining the ease of identifying controls is their discrim- 
inability. Many experiments have been conducted to determine how far apart 
two controls must be, how much they must differ in size, or what shape they 
must have to be differentiated with a minimum of error (see 43, pp. 39°- 
397). Special attention has been given to the shapes of handles for levers, 
switches, and adjustment controls of various sorts (30, 35, 36, 61). Shape of 
control handles is of particular interest because of its many usable variations, as 
contrasted, for instance, with the limited number of practicable sizes. Unlike 
colors, moreover, shapes can be identified tactually, when the operator's vision 
is otherwise occupied or when the control must be operated in dim light. 

In one Air Force experiment on the tactual identification of knobs (36), 25 
plastic shapes were mounted on rods that were bolted into a turntable, as 
shown in Figure 68. Each subject, wearing blindfold goggles, was given a test 
knob to explore for one second. The turntable was then rotated to a pre- 
designated point, and the subject felt each knob in turn until he found one 
that he thought was the test knob. The same procedure was repeated with 
each shape as a test knob. The order of test and comparison knobs was varied 
systematically to balance out intraserial effects. Forty Air Force pilots served 
as subjects. 

An analysis of the frequency with which each shape was confused with 
every other shape revealed certain sets of shapes that were most clearly dis- 
tinguishable. Figure 68 shows one set of eight shapes that were never confused 
with each other by any of the subjects. Comparison of results obtained with 
gloves and with bare hands showed no essential differences in the relative 
discriminability of shapes, although more errors were made in general when 
gloves were worn. Similarly, the investigation of horizontally mounted con- 
trols yielded about the same results as those obtained with vertically mounted 
controls, suggesting that mode of mounting or position in which the knob is 
grasped is not a critical factor in its identification. 

A third way to facilitate identification of controls is to utilize meaningful 
associations. This technique is somewhat similar to the use of population 
stereotypes in establishing display-control directional relations. Certain well- 
known color symbols can be relied upon in coding controls. The association of 
red and green lights with "stop" and "go" and the identification of red with 
fire or danger of any sort are familiar examples. Meaningful symbols can also 
be incorporated in shape coding of controls. The Air Force now uses several 
control knobs whose shapes suggest their functions (see 43, p. 392). The 
landing-gear knob, for example, is shaped like a landing wheel and the flap- 


control knob is shaped like a wing. 
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Fig. 68 Procedure employed in study of tactual discrimination of shapes 
for use in coding aircraft control knobs. The eight shapes shown below 


the photograph were never confused with each other. (From Jenkins, 
36, figs. 14.1, 14.6.) 


MAN-MACHIN 


; SYSTEMS 


Basic Concepts. If we were to name the one most distinctive contribution 
of modern engineering psychology, it would probably be the concept of man 
machine systems. A schematic diagram of such a system is given in Figure 


r 


69 Essentially, this approach requires that we consider the man and the 
machine he operates as interacting parts of a single system Psychologists 
be gan to use such an approac h in their efforts to communicate w ith engineers 
It is characteristic of man-machine systems that the functions of the human 


operator are described in engineering terminology [hes functions include 
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Fig. 69 Schematic diagram of a man-machine system. (Adapted 
from Taylor, 55, p. 250.) 


information receiving, data processing, and operation of controls. Inputs, such 
as temperature, amount of gasoline, direction and speed of motion, and the 
like, are translated by the mechanisms into signals, which are transmitted to 
| the human operator by the displays. In many situations, of course, the operator 
receives at least some information directly, as from looking at the road when 
driving a car. The operator combines the new information with his stored 
knowledge and thereby decides upon a course of action, such as the operation 
of a particular control. The mechanism activated by the control in turn leads to 
a corresponding output, such as turning the car to the right or increasing the 
altitude of the plane. 

In a closed-loop system, the results of the operator's actions are fed back into 
the displays, and such feedback produces constant readjustments in the oper- 
ator’s responses. The previously discussed tracking tasks are examples of 
closed-loop man-machine systems. A familiar closed-loop system not involving 
a human operator is a heating system controlled by a thermostat. When the 
room temperature drops below the setting, the thermostat turns on the burner. 
| When this action causes the room temperature to rise to the designated level, 
| the thermostat turns off the burner. In engineering parlance, such a closed- 
loop system is known as a “servomechanism.” 

Apart from facilitating interdisciplinary communication, the concept of man- 
machine systems has served to advance our understanding of human perform- 
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ance. This approach makes it evident that what has often been regarded as 
the performance of the man is actually total system performance. Changing 
the conditions in a tracking experiment, for example, may greatly improve the 
output of the man-machine system. But it cannot be assumed that the per- 
formance of the human operator has improved under these circumstances (55, 
58). Improvement in system output resulting from changes in task variables 
should be distinguished from improvement in operator performance, such as 
that resulting from selection or training, The effects of stress on performance 
need to be similarly analyzed. In one experiment, the deterioration of per- 
formance with stress was found to be much more closely related to task vari- 
ables than to operator variables (23). Thus performance of the intrinsically 
more difficult tasks deteriorated more under stress than did performance of 
the simpler tasks, regardless of the proficiency or training level of the oper- 
ator. 

By investigating the performance of the whole man-machine system, it is 
possible to specify the proportion of total system error attributable to each of 
its parts. As a result, effort can then be focused on improving those parts of 
the total operation that account for a major proportion of performance errors. 
Such an analysis may indicate that changes could most profitably be introduced 
in display characteristics, in control characteristics, or in the intervening data 
processing that the operator must perform. By altering the system dynamics, 
some of the data-processing load may be removed from the operator and built 
into the machine. The operator's task is thereby simplified, and the speed and 
accuracy of overall system performance may be increased to unprecedented 
levels, 

Another way in which system performance can be improved is by the in- 

troduction of a “quickening” device (56). Because of the speed at which jet 
planes travel, for instance, the pilot must make his next control movement 
long before the plane has finished responding to the preceding adjustment. 
With a quickening device in the system, the pilot will be able to read. off 
from a predictor instrument. where the plane will be in a few seconds if the 
controls remain steady. This makes the pilot’s task less taxing and more man- 
ageable. 
_ Some investigators have approached the problem from the opposite angle, 
inquiring into the effects of delayed sensory feedback (51, 52). For example, 
subjects carrying out such visual-motor tasks as writing, drawing, or maze 
tracing were not allowed to observe their performance directly but could 
follow it on a closed-circuit television screen. Introducing a delay of about 
half a second in such visual feedback caused a marked deterioration in both 
quality and speed of performance. This problem is of some relevance to space 
science, since earth-based guidance of lunar and planetary vehicles may de 
pend upon the delayed feedback provided by televised images. Ongoing re 
search is concerned with the effect of longer feedback delays and other con- 
ditions upon the operation of simulated remote-control systems (51). 

The interchange of functions between man and machine is facilitated by the 
use of engineering models to describe human behavior. If the machine is to 
take over some of the functions of the human, these functions must first be 
specified in the terminology of physics and mathematics. Analyzing human 
behavior in terms of a new set of constructs and models should in turn pro 


241 


Equipment Design 


vide psychologists with fresh insights, sharpen their concepts, and open up 
new fields of research (5, 55). The rapidly growing body of psychological 
research on information theory and on decision making illustrates some of the 
effects of engineering psychology upon basic psychological research (see 43, 
ch. 16). 

Link Analysis. A technique that has proved useful in systems development 
is that of link analysis. By this method, data are gathered regarding operational 
connections between different units within a system. The units thus investi- 
gated may be individual dials or other signals on a display panel, individual 
controls on a control panel, different machines operated in succession by the 
same man or by different men, and individual operators working together 
within a system. A strong link between two controls, for example, would 
mean that they are often used in sequence and should therefore be placed 
close together. A strong link between two operators would mean that they 
must frequently communicate, hand each other materials, or interact in other 
ways. 

The purpose of link analysis is to utilize the empirically observed connec- 
tions among units in order to improve the arrangement of displays, controls, 
machines, and men. In this respect, it may be regarded as an extension of 
the problem of work layout considered in an earlier section of this chapter. 
Insofar as link analysis is concerned with connections among several workers, 
it also bears some relation to communication nets and to organization theory, 
discussed in Chapter 6. 

There are basically two methods for obtaining link data: through observa- 
tion of the system in operation and through the judgments of operators. Ob- 
servational techniques are more suitable for determining frequency of link 
usage, while judgment techniques must usually be employed to estimate the 
relative importance of links. When the use of observational methods is not 
practicable, judgments must be relied upon for frequency data as well. 

Direct or unaided observation may be adequate for a simple system con- 
sisting of few units. Motion pictures help in recording the manipulation of 
controls or the movement of men from one work station to another. For link 
analysis of visual displays, an eye camera may be employed. The eye camera 
photographs reflections of light from the eye. These reflections show the se- 
quence of fixation points and the duration of each fixation. This method was 
used in an intensive link analysis of aircraft display instruments (37, 46). 
'The percentage of times eye movements occurred from each display to every 
other was determined during different flight maneuvers, such as climbing, 
banking, and landing. In the same way, information was obtained regarding 
length of fixation, number of fixations per minute, and proportion of time 
spent fixating on each instrument. On the basis of all this information, the best 
arrangement of the six key indicators was identified, and this arrangement has 
now become standard in all military and commercial planes in the United 
States (see 43, p. 373). 

When information is obtained on both frequency and importance of differ- 
ent links, it is customary to combine the two sets of link values into a com- 
posite index. For this purpose, frequency and importance must first be ex- 
pressed in the same units. A three-point scale is commonly used, in which 1 
represents the least and 3 the most frequency or importance. The composite 
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link value can then be found by multiplying frequency and importance values 
for each link. These composite values can be presented in a schematic diagram, 
as illustrated in Figure 7o. 

The hypothetical system portrayed in Figure 7o includes three men and five 
machines. Three types of link are shown, namely, control links, visual com- 
munication links, and auditory communication links. Composite link values, 
which may range from 1 to 9, are given for each link. The upper part of 
Figure 7o contains the original arrangement of men and machines; the lower 
part contains the revised arrangement. The major objective of such rearrange- 
ment is to minimize the distance between units that have high link values. In 
general, the distance between units should be inversely related to their link 
values. A second objective is to minimize the number of crossovers of link 
connections. 


Fig. 70 Hypothetical example of rearrangement of men and machines 
as a result of link analysis. Link values are given in small circles on 
link connections. (From McCormick, 43, p. 411.) 


Q Men (1,2,3) 


m Machines (A, B, C, D, E) 
—C)— Control links 
se <> — Visual links 


(a) Original arrangement -Q-— Auditory links 
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These objectives can be approximated by inspection of the diagrams and by 
trial-and-error procedures. Mathematical techniques are also available to deter- 
mine the arrangement that minimizes the link-by-distance relationships among 
units, although their application to large systems is very laborious. Any given 
layout, moreover, must usually be evaluated in terms of several criteria, such 
as visibility of visual displays, intelligibility of spoken communications (as a 
function of distance, cross talking, and other noise), amount of walking re- 
quired, crowding and extent to which adjacent workers may interfere with 
each other's work, and other considerations relevant to the specific layout. 

System Simulation. Another technique utilized by engineering psycholo- 
gists is system simulation, which permits the investigation of man-machine 
systems under laboratory conditions (see 19, ch. 13). An example is provided 
by a long-range project conducted at the Laboratory of Aviation Psychology 
of Ohio State University (14). This investigation was concerned with air- 
traffic control systems, whose function is the safe and efficient control of air- 
craft within a terminal area of about 50 miles around the airport. By means of 
a specially designed electronic analog simulator, it was possible to present 30 
independently controlled “aircraft,” or target blips, on simulated radar scopes. 
A “pilot room” and a “radar control room” were set up in the laboratory 
(Figure 71). Both provided realistic representations of actual field situations. 
The subjects consisted of trained air-traffic controllers and pilots. 

Several criteria of system performance were employed to study the effects 
of different task and procedural variables. Among these criteria were average 
fuel consumption, average amount of time required to bring an aircraft in or 
to take it out to a point 5o miles from takeoff, amount of deviation of incoming 
aircraft from the heading of the runway, and number of conflicts (defined as 
failure to maintain minimum separation between aircraft). 

Among the many conditions investigated were various display characteristics, 
load variables, as illustrated by the rate at which aircraft enter the terminal 
area; and procedural variables, including communication procedures and types 
of instructions that controllers may issue to pilots. Organizational variables 
were also investigated. For example, how does the performance of two con- 
trollers working in a face-to-face situation compare with that of two controllers 
assigned to separate work stations? When two controllers work together, what 
are the effects of dividing responsibility between them on the basis of distance 
(farther aircraft controlled by one, nearer aircraft by the other) or on the 
basis of sectors (aircraft entering northern sector controlled by one, those 
entering southern sector by the other)? 

It is anticipated that subsequent research conducted in this project will be 
concerned increasingly with problems of small-group organization and profi- 
ciency. By investigating small-group behavior under such carefully controlled 
laboratory conditions, this type of research cuts across the traditional areas of 
social and experimental psychology. Its findings are likely to extend far beyond 
the immediate practical problems under investigation and may ultimately con- 
tribute to basic psychological research. 

Simulators have been used widely in the development of man-machine 
systems for space flight. Psychological research has been conducted with sub- 
jects confined for extended periods in sealed cabins that reproduce most of the 
living conditions of space cabins and require the performance of realistic 
control tasks (see 32). Another promising application of system simulation is 
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Fig. 71 Simulated pilot room and radar control room used in research 
on air-traffic control systems. (Courtesy of G. E. Briggs, Aviation 
Psychology Laboratory, Ohio State University.) 


to be found in the development of remote-control systems for earth orbital 
satellites and for lunar or planetary vehicles (10, 51, 52). By means of simu 
lators, it is possible to compare the efficiency of alternative control procedures 
and to investigate the effect of various conditions upon the performance of the 
total man-machine system." 


SUMMARY 


Although dating largely from World War II, modern research on pers 
factors in equipment design was foreshadowed by the time-and motion stuc y 
of Taylor and Gilbreth, by certain investigations of the Great Britain In 


! Besides its use in the development and improvement of man-machine systems, simul 
tion has been widely employed for training purposes, as illustrated in Chapter 5. 
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dustrial Fatigue Research Board, and by other early research. The immediate 
stimulus for the more recent involvement of psychologists in equipment design 
came from the realization that the complexity of machines was beginning to 
exceed man's capacity to operate them. Hence it became necessary to redesign 
the equipment so as to make the operator's job less demanding. In contrast 
to earlier psychological contributions to equipment design, modern engineer- 
ing psychology is more extensive, systematic, research-oriented, and compre- 
hensive in its approach. 

One of the problems of equipment design is that of the spatial layout of 
work. This is illustrated by research on the pre-positioning of tools, the semi- 
circular work space, and the proposed redesign of the typewriter keyboard. An- 
other problem pertains to the improvement of displays through which informa- 
tion is transmitted to the operator. Although visual displays are the most com- 
mon, auditory displays are particularly useful in certain situations. Some re- 
search has also been done on the potentialities of tactual displays. With regard 
to visual displays, extensive research has been conducted on the factors influ- 
encing speed and accuracy of dial reading, such as dial size and shape, length 
of scale units, number and spacing of scale markers, direction of pointer move 
ments, size and style of letters and numerals, and the like. There is evidence 
of interaction between such display variables and procedural variables, such 
as type of reading required. 

Research on control problems can be illustrated by the hundreds of experi- 
ments on tracking, most of which have investigated the effects of task variables 
upon tracking performance. Studies of display-control compatibility have been 
concerned with isomorphism of display and control patterns as well as with 
population stereotypes. A final example is provided by research on the coding 
of controls in terms of position, size, shape, or color. Confusion among controls 
can be reduced by standardizing similar controls, increasing the discrimin- 
ability of different controls, and utilizing meaningful associations for the 
identification of controls. 

The broadening scope of engineering psychology is best typified by its grow- 
ing concern with man-machine systems. By describing the functions of the 
human operator in engineering terminology, such an approach makes it pos- 
sible to transfer some of these functions from the man to the machine. The 
systems approach also permits the efficient arrangement of men and machines 
within a complex system. Two techniques that illustrate the procedures em- 
ployed in system development are link analysis and system simulation. It is 
through the study of man-machine systems that the greatest improvements in 
speed and accuracy of operation have been achieved. It is through this ap- 
proach, too, that engineering psychology is most likely to contribute to the 
advancement of basic psychological knowledge. 
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Scope and Methods 
of Consumer Psychology 


While personnel and engineering psychology are concerned primarily with 
the behavior of the individual as a producer, consumer psychology deals with 
his behavior as a consumer. This field of psychology began with the psychol- 
ogy of advertising and selling, whose object was effective communication from 
the manufacturer or distributor to the consumer. Through advertising, the 
consumer is informed about available products or services and about the par- 
ticular ways in which they may meet his needs. Advertising itself dates from 
the earliest recorded periods of human history (see 26, ch. 1; 84; 9o). The 
excavations of Pompeii, for example, disclosed advertisements for a variety 
of products or services, painted on the walls of public buildings. Newspaper 
advertising appeared shortly after the establishment of the first newspapers in 
the seventeenth century. Advertising psychology (31, 108) was launched dur- 
ing the first two decades of the present century with scattered laboratory 
studies and with the publication of books by such pioneers as Walter Dill 
Scott (93) and H. L. Hollingworth (40). 

More recently, consumer psychology has been broadened to cover two-way 
communication, from consumer to producer as well as from producer to con- 
sumer, Communications from the consumer were first sought in the 1920s as a 
means of preparing more effective advertisements (58). Systematic inquiry into 
consumer needs and wants enabled the advertiser to identify those features of 
his product that would be of greatest interest to the consumer. Such consumer 
surveys led directly to the next step, which involved the consideration of con- 
sumer wants and preferences in the actual designing of products. The psychol- 
ogy of advertising has now been incorporated into the broader field of con- 
sumer psychology. One indication of the increasing vigor of this field of research 
is the formation in 1960 of a division of consumer psychology in the American 
Psychological Association and the establishment, that same year, of the Journal 
of Advertising Research. 

The methods of consumer psychology are applicable to a number of fields 
besides that of developing and marketing industrial goods and services. One 
such application is to be found in public opinion polls, whose procedures are 
essentially the same as those of consumer surveys. Another example is provided 
by the well-designed research on food preferences conducted by the armed 
services as an aid in the planning of meals for military personnel. As a final 
illustration may be mentioned the use of advertising campaigns in connection 
with problems of general public interest. These campaigns have become a 
familiar part of the American scene. Their objectives range from the preven- 
tion of forest fires to the promotion of mental health. A specific example is 
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provided by the advertising campaigns and the attitude surveys conducted by 
the U.S. Public Health Service in the effort to encourage widespread use of 
polio vaccine (45, 89). Whether such campaigns are undertaken by a govern- 
ment agency, a voluntary organization, an industrial corporation, or The Ad- 
vertising Council, their success depends upon the same principles as all forms 
of advertising. In all of them, the communication must catch and hold the 
attention; it must be easily understood; and it must induce the appropriate 
action. 

Modern consumer psychology covers a wide variety of problems, and its 
methods are correspondingly diverse. In the present chapter we shall examine 
some of the typical research procedures followed in five major areas: testing 
the effectiveness of advertisements, radio and television research, personal sales- 
manship, consumer preferences and product design, and economic psychology. 


TESTING THE EFFECTIVENESS OF 
PRINTED ADVERTISEMENTS 


Psychological studies of advertising vary in scope from the evaluation of a 
whole sales campaign, through a comparison of the relative merits of two al- 
ternative ads for the same product, to studies of specific features of ads, such as 
the legibility of printed type or the feelings aroused by different colors. Since 
the techniques employed in radio and television research will be discussed later 
in the chapter, this section will be concerned only with printed or visually 
presented advertisements, as illustrated by magazine, newspaper, direct-mail, 
car-card, and billboard advertising. For convenience, the procedures will be 
grouped under the following categories: consumer jury technique, readership 
surveys, controlled exposure studies, brand use surveys, analysis of coupon re- 
turns, and sales tests.? 

Consumer Jury Technique. Although essentially an opinion test, the con- 
sumer jury technique is based on the opinions of a consumer sample, rather 
than on the opinions of advertisers. Because of the advertising specialist’s con- 
tinuing professional involvement with ads, his reactions to an ad are likely to 
be quite unlike those of the general public. Some large advertising agencies 
maintain a consumer panel so chosen as to be a representative sample of the 
consumer population. In terms of available data regarding age, sex, socio 
economic level, education, and other relevant variables, individuals may also 
be selected from such a panel so as to constitute appropriate samples for specific 
products, such as baby foods or Cadillacs. 

Copies of ads may be submitted to consumer juries by mail or by personal 
interview, with such questions as the following: "Which of these two ads 
would interest you more?" "Which would be more likely to induce you to 
buy?" "Which of these two headlines would catch your attention. more 
readily?" Despite their subjectivity, consumer jury tests often yield results that 
! Much consumer research is published in sources not generally familiar to psychology 
students. A helpful guide to this literature, which lists bibliographies, abstracting 


journals, directories, and other relevant sources, can be obtained from the Advertising 
Research Foundation, New York (95). 


2 More detailed discussion of techniques for measuring advertising effectiveness can be 
found in Lucas and Britt (63). 
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agree closely with those of more objective tests based on actual returns (16; 
58, pp. 119-122). To ensure that panel members respond to the ads as typical 
consumers, however, the same persons should not be employed too often in 
judging ads. It is also desirable to have individuals examine only ads of those 
products for which they are themselves potential consumers. If a bachelor is 
asked to evaluate ads for baby foods or washing machines, he is likely to 
assume the role of an advertising critic rather than that of a consumer. 

Readership Surveys. Specialized survey organizations, such as those 
founded by Daniel Starch and by George Gallup, conduct continuing reader- 
ship surveys for a number of magazines and newspapers. In addition to esti- 
mates of the size and composition of the total audience reached by each of 
these media, the surveys provide comparative data on the percentage of persons 
who remember each of the ads appearing in them. Both recall and recognition 
methods may be used in such surveys. Aided recall is illustrated by the “im- 
pact” technique used by Gallup & Robinson, Inc. Every respondent who has 
read a given issue of the magazine or newspaper is given a set of brand or 
company names and is asked whether he remembers seeing an ad for any of 
them in that issue. For each ad he claims to have seen, the respondent is re- 
quired to “play back” its selling message from memory. The playback is simply 
a report, in the respondent's own words, of all he can remember about the ad. 
To elicit the full report, the interviewer asks a series of probing questions per- 
taining to what the ad looked like, what it said, and so forth (see 37, pp. 176- 
177). 

The surveys conducted by Daniel Starch and Staff, on the other hand, 
typically rely on the recognition of ads. Every respondent who has seen the 
issue under investigation is shown each ad in turn and is asked three questions 
about it: whether he remembers seeing the ad, whether he noticed the name 
of the product or advertiser, and whether he read 5o per cent or more of the 
written matter in the ad (see 37, pp. 172-176). From the replies to these 
questions, three corresponding Starch scores are computed, known as the 
"noted," "seen-associated," and "read most" scores, respectively. Beginning in 
1955, Starch added "Reader Impression Studies" to his survey services (100). 
"These studies deal with the attitudes aroused by the ad as a whole, the product, 
the illustration, and the written matter. Through a series of probing questions, 
the interviewer explores what the ad means to the respondent, whether any 
part of it arouses resistance, the extent to which claims are accepted, and the 
like. 

In recognition tests, some persons will usually report that they saw an ad 
even if it has not yet appeared in any medium. These false recognitions may 
result in part from confusion with other similar ads, especially in the case of a 
widely advertised commodity. Some respondents, too, may deliberately falsify 
their responses so as to impress the interviewer with their powers of observation 
or for the many other reasons that prevent frankness in a face-to-face interview. 
In order to allow for false recognitions, some ads that have not yet appeared 
may be interspersed with the regular ads. The percentage of persons who 
falsely recognize the control ads can then be used as a correction term in ad- 
justing the percentages obtained with the published ads. Since ads vary widely 
in the frequency of false recognition, however, the ideal procedure is to de- 
termine this frequency for each ad prior to its publication and to use these 
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figures to correct the postpublication results for each ad separately (61, 64). 
There is some evidence that the inflation of readership scores is higher for ads 
of well-known brands than for ads of less familiar brands (55). 

A well-designed comparative study of the Starch and the Gallup-Robinson 
survey techniques was conducted by the Advertising Research Foundation’s 
Committee on Printed Advertising Rating Methods, commonly known as the 
PARM study (1, 62). A single issue of Life magazine was investigated by 
interviewers who followed as closely as possible the procedures of the Starch 
and Gallup-Robinson readership surveys, respectively. Readership data ob- 
tained in the usual way by these two commercial survey organizations were 
also available for this issue of the magazine. The sample contacted in the 
PARM study was larger and more nearly representative than those ordinarily 
employed in readership surveys, each group including over 13,000 households 
and yielding about 600 respondents who had read that issue of Life. 

The PARM “noted” scores obtained for each ad agreed quite closely with 
those reported by Starch for the same ads. The correlations between the two 
sets of scores were .86 for men and .92 for women respondents. For the Gallup- 
Robinson aided-recall scores, agreement was not so close, as indicated by cor 
relations of .61 and .82 for men and women. It is likely that the more subjec- 
tive and unstandardized nature of the aided-recall procedure accounts in part 
for these differences, The PARM study showed, for instance, that rated 
competence of interviewers was related to aided-recall scores but not to noted 
scores, The investigation also yielded a wealth of data on many specific meth- 
odological questions. For example, by rotating the order of presentation of 
ads, the PARM study established that readership scores tend to be higher for 
ads presented early in the interview and lower for those presented later. This 
finding led to a revision of the Starch survey procedure whereby each inter- 
view now begins on a randomly selected page, rather than proceeding uni- 
formly from front to back of the magazine as heretofore. 

In its efforts to devise improved methods for readership surveys, the PARM 
committee developed an instrument called the communiscope (29). By means 
of a portable slide projector and a light-timing device, each ad can be exposed 
for a fraction of a second (usually 0.6 second). This technique represents an 
adaptation of the tachistoscope, an instrument used widely in the psychological 
laboratory for the rapid exposure of visual materials. Immediately after each 
exposure, the respondent is asked whether he remembers seeing the ad. If he 
does, he is further asked to name the brand advertised and to describe what 
he remembers about the ad. This “playback” of the advertising message is 
picked up by a portable tape recorder for later analysis. 

Since the communiscope permits a momentary glimpse of the whole ad, it 
may avoid both the “overstimulation” of the Starch recognition method an 
the “understimulation” of the Gallup-Robinson aided-recall method (29). In 
the former, the respondent probably recalls too much, since the ad is before 
him all along; in the latter, he probably recalls too little, since it is dificult to 
remember a specific ad when only the name of the company is given. 

The communiscope is a promising technique which permits more objectivity 
and better control of conditions than do traditional readership survey pro 
cedures. A well-designed field test of its effectiveness yielded encouraging re 
sults (72). Thirty ads representing over 10 product categories were presente 
to two matched samples of 216 respondents each. Each respondent who 
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claimed to have seen the ad was questioned about brand name and other 
aspects of the ad. The same procedure was followed with a representative 
group of the respondents who said they had not seen the ad before. In this 
way it was possible to determine how much the respondent could learn about 
each ad during the tachistoscopic exposure itself. Only those respondents who 
recalled more about an ad than this control amount were recorded under 
"proved recognition." The proved-recognized scores for the 3o ads ranged from 
2 per cent to 32 per cent, suggesting that the technique is capable of discrimi- 
nating widely among different ads. Its reliability was indicated by the very 
close agreement obtained in the two samples of 216 cases. The largest differ- 
ence between the proved-recognized scores found in the two samples for any 
ad was less than 1 per cent. 

Controlled Exposure Studies. Certain research procedures provide meas- 
ures of the subject's responses to ads that are initially presented under con- 
trolled conditions. This approach differs from that of readership surveys, 
which measure the effects of uncontrolled everyday-life exposure to printed 
ads. For example, a specially prepared dummy magazine may be distributed 
to readers and then followed up with recall or recognition tests. The dummy 
magazine is so constructed as to hold constant all variables except those under 
investigation. Thus, if we wanted to test the effect of size of ads upon their 
attention and memory value, we would insert ads of different sizes in which 
such other characteristics as color, illustration, and position were equated. 
Dummy magazines may also be used in situations where the subject is ob- 
served—possibly through a one-way vision screen—while he thumbs through 
the magazine. The experimenter can thus determine directly which ads the 
subject looks at. 

Similarly, ads may be exposed in a store window in accordance with some 
predetermined plan, while a concealed observer records the proportion of 
persons passing by who stop to look at each ad. Or subjects may be driven past 
a series of billboards and later tested for recall of their content. A number of 
laboratory studies have been concerned with subjects' reactions to specific 
features of ads, such as color, printing type, or illustration. 

Observations of subjects’ behavior toward ads can sometimes be made more 
objective and precise by the use of certain well-known laboratory instruments. 
Eye cameras have been employed to photograph the subjects’ eye movements 
while he is looking at ads. This methods provides a continuous record of all 
eye fixations. With it we can determine the order in which different ads are 
examined, the amount of time spent looking at each, and the path followed 
by the eye as it explores the different parts of a single ad. Portable models of 
the eye camera have been developed for use in advertising field studies (35, 
pp. 282-285; 50; 68; 69). One such model is shown in Figure 72. It enables 
the subject to turn pages as he chooses and move his head freely. The maga- 
zine is read through what appears to the subject as a transparent glass but is 
in fact a partially silvered mirror that reflects an image of the face onto a 
motion-picture camera behind the subject. k 

Another useful instrument is the previously mentioned tachistoscope, which 
permits the rapid exposure of visual stimuli. Such an instrument could be used, 
for example, to determine the length of headline that can be correctly read at 
a glance. Adapted for field use, it can also be employed in testing ads prior to 
publication, Experimental ads can thus be compared as to the amount and 
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Fig. 72 Purdue Eye Camera in use. (Courtesy of James S. Karslake.) 
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clarity of information they convey during a quick, cursory inspection. Still 
another exposure device developed for use in advertising research is the Visual 
Testing Apparatus (VISTA) described and illustrated in Chapter 12. By 
means of this instrument, it is possible to control not only duration of exposure 
but also distance, illumination, and other stimulus variables. 

Some use has also been made of the galvanic skin reaction to investigate the 
strength of emotional responses aroused by different ads. This familiar labora- 
tory technique uses a galvanometer to detect the slight increases in secretion 
of the sweat glands that occur during emotional excitement (see 35, pp. 284- 
286). 

Brand Use Surveys. ‘The methods discussed so far have been concerned 
with reactions to the ads as such. Does the ad attract and hold attention? Does 
it make a lasting impression, so that it can be recalled or recognized later on? 
It is obvious that these are necessary but not sufficient conditions for effective 
advertising. The ultimate question is whether the ad succeeds in leading to 
appropriate action, Although more difficult to measure under controlled con- 
ditions than the attention or memory value of ads, buying behavior provides 
the most comprehensive index of advertising effectiveness. 

Brand use surveys represent one technique for the investigation of buying 
responses. The survey may be conducted by mail, telephone, or house-to-house 
interviews. The respondent is merely asked what brand of a certain commodity, 
such as coffee or soap, is currently being used in the household. A single 
survey can conveniently cover a number of common household commodities. 
The frequency of brand use can be tied in with advertising by including 
questions about the magazines and newspapers that are read regularly in the 
home. Thus if brand A is advertised in magazine X, we can compare the 
frequency of use of brand A among readers and nonreaders of magazine X. 

Several procedures may be followed to check on the accuracy of replies in 
brand use surveys. For example, the interviewer may arrange to return and 
collect empty containers, for which the respondent receives some premium. Or 
the interviewer may offer to buy old magazines for a nominal sum. A “pantry 
check” of the brands actually on the shelves at the time of the interview is 
sometimes feasible. 

Starch (99) reports a 16-year study in which some 400,000 interviews were 
conducted to determine the relation between ad readership and reported prod- 
uct purchases. The study covered a total of about 45,000 ads appearing in the 
Saturday Evening Post or Life. Ad readership was measured by the seen- 
associated score obtained by the usual Starch procedure. In addition, respond- 
ents were questioned about purchases they had made during the first week 
after the publication of each ad. Results were analyzed so as to show net-ad- 
produced purchases (called “Netapps”). This index was derived from the 
proportion of ad readers who bought the product and the proportion of non 
readers of the ad who bought it. On this basis, Starch concluded that, on the 
average, the net effect of these magazine ads in increasing the number of 
buyers was 13.04 per cent. A limitation of this type of survey stems from the 
difficulty of analyzing cause-effect sequences. Did a respondent begin using 
the product because he saw the ad, or did he notice the ad because he was 
already using that brand? 

Some attempt to disentangle these relations was made in special parts of the 
Starch study. For instance, 898 ad readers were paired with nonreaders of the 
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same ads who had the same buying rate for the product before the ad appeared. 
Following the publication of the ads, those who had read the ads made 14.5 
per cent more purchases than the nonreaders of the ads. Further supporting 
data were obtained by noting the increases in purchases by issue readers in 
the case of products that had stopped advertising in that magazine. 

Coupon Returns. The analysis of coupon returns provides another oppor- 
tunity to compare the effectiveness of ads appearing in magazines or news 
papers. By keying the coupons in some convenient way, those belonging to 
each ad can be identified. Actual buying behavior can be studied if the coupon 
is used for mail-order purchases. When the coupon is inserted merely to test 
the attention value of different ads, it may offer a free sample, a recipe booklet, 
an informative brochure about the product, or some other inducement. Under 
these circumstances, however, we cannot assume that coupon returns indicate 
potential sales, Coupons are often filled out by children or by habitual clippers, 
who may clip the coupon just because they like to receive mail or because 
they cannot resist a free offer of any article. Thus, unless actual mail-order sales 
are involved, coupon returns indicate only the relative attention value of the 
ads themselves. 

In any event, it is essential that both the nature of the offer and the physical 
characteristics of the coupon be held constant if the difference in returns is to 
reflect the relative effectiveness of the ads. The position of the coupon, for 
example, will greatly affect frequency of returns. Coupons in the outside 
bottom corner of the page are more conspicuous and easier to clip than coupons 
in the lower center of the page. And returns are nearly twice as frequent for 
the former than for the latter position (98). 

The ads themselves should also be comparable in all characteristics except 
those under investigation. Ideally the ads should appear on the same page of 
the same issue of the same magazine. If less than full page in size, the ads 
should occupy the same position on the page. This apparently impossible feat 
can be accomplished through the split-run technique. By arrangement with 
the magazine or newspaper, one half of the copies of a given issue carry one 
ad, and the other half carry the other ad. Comparable distribution to all out- 
lets is assured by having the two editions alternate as they come off the press. 

Splitrun tests may also be applied to ads that do not carry coupons, provided 
their distribution is controlled and the results can be tied in with the form of 
the ad employed. The two ads for Du Pont textile fibers reproduced in Figure 
73 were among those investigated by a technique combining split run with a 
type of readership survey (71). Some time after the ads had been run in a 
technical magazine, a random sample of subscribers who had received each 
form were mailed a reprint of the appropriate ad from which all copy had been 
deleted. This was accompanied by a questionnaire asking the respondent to 
give the magazine in which the ad had appeared, the advertiser, the product, 
and the main idea of the advertising message. The replies to all four questions 
favored ad B, which featured present rather than future uses of the product. 
Such a copy-deleted test could not be used with ads whose illustrations provide 
cues to the product advertised. Moreover, when ads differ in several respects, 
as do those in Figure 73, it is impossible to identify the specific reason for the 

difference in their attention and memory value. 

Sales Tests. The most obvious way to find out how well an ad works is 
to see what happens to sales after the ad is run. Although apparently simple, 
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Fig. 73. Two ads used in a split-run test. (See McNiven, 71; : 
Teproduced by courtesy of E. 1. duPont de Nemours & Company, Inc.) 
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such sales tests are expensive and difficult to conduct. A major difficulty is that 
of ruling out the effect of other variables on sales. If there is a sudden spell 
of bad weather with a resulting increase in colds, the sale of cold remedies will 
rise regardless of advertising. In the effort to control these extraneous in- 
fluences, sales tests are conducted simultaneously in several cities. 

First, a number of typical and comparable cities are selected for study. Each 
new sales campaign to be tested is then introduced in a randomly chosen set 
of these cities. Other cities from the original group serve as controls in which 
no advertising is run or the current advertising is continued unchanged. 
Through special arrangement with retailers, a record of sales is kept for a 
designated number of weeks prior to the campaign, while it is in progress, and 
after its completion. Changes in sales in each of the experimental cities can 
then be evaluated in terms of concurrent changes in the control cities. Sales 
tests conducted in actual practice, however, often fall short of ideal require- 
ments, especially with regard to comparability of experimental and con rol 
cities. 


RADIO AND TELEVISION RESEARCH 


Much of the continuing research on radio and television deals with the size 
and composition of the audience reached by different programs. This research 
is similar to the readership surveys conducted with magazines and newspapers. 
The focus, however, is on the entire program rather than on the advertising 
or “commercials,” on which relatively little direct research has been done. A 
second type of investigation is concerned with preferences and audience reac 
tion in general. A third type, involving direct measurement of the sales effec 
tiveness of radio and television advertising, is so far represented by few studies. 
The difficulty of obtaining sales data under adequately controlled conditions 
is undoubtedly a serious stumbling block in the way of this last type of re- 
search. 

Audience Size and Composition. Many techniques have been devised 
for finding out how many people listen to each program and who they are in 
terms of sex, age, education, socioeconomic level, geographical distribution, and 
other characteristics. The results of such surveys form the basis of the widely 
publicized ratings of programs and stations. No one survey method is com- 
pletely satisfactory. Each has its own advantages and weaknesses, and each 
provides a somewhat different kind of audience information.* 

i One simple technique utilizes a hidden offer. At some inconspicuous point 
in the program, a free article is offered. The article chosen is one that is gen 
erally desirable and need not be related to the products advertised on the 
program. It is assumed that, owing to the "hidden" position of the offer, those 
who write in for the article have actually listened to all or most of the 
program, rather than having heard only the offer. This assumption is probably 
correct in general, although some respondents may have heard about the offer 
from a friend or neighbor. The latter objection, of course, also applies to 
8A detailed evaluation of current broadcast rating procedures, with suggestions for 
needed research, is to be found in the Madow-Hyman-Jessen Report. Prepared by 
three eminent statisticians at the request of a congressional committee, this report has 
been summarized in one of the publications of the Advertising Research Foundation 


(73). 
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analyses of coupon returns. At best, the hidden-offer technique can provide 
only a rough comparison of the relative audience coverage of two or more 
programs in which similar offers are made. 

Telephone inquiries are now a well-established method for evaluating the 
audience coverage of different programs. The survey may cover a designated 
period, such as the preceding day. In that case, the respondent is asked during 
what hours the radio or television was on and what programs were heard. He 
might also be asked who was listening to the program—how many men, 
women, and children. The replies to such a survey are obviously subject to 
errors of memory and other inaccuracies of report, despite the recency and 
brevity of the period covered. 

To avoid these difficulties, the coincidental technique was evolved. In a 
telephone inquiry, the respondents are asked whether the radio (or television) 
is on at the time and if so what program is tuned in and what is the name of 
the sponsor or product advertised. The calls are made to a random sample of 
telephone owners, drawn from the telephone directory of each city. This type 
of survey provides the data for the well-known "Hooperating" of radio and 
television programs, prepared by the C. E. Hooper organization. With suitable 
sampling procedures, the coincidental telephone technique can also be used on 
a much smaller scale to investigate the coverage of local programs (75). For 
the latter purpose, as few as 20 or 3o calls may yield data at a satisfactory 
level of statistical significance. 

A limitation of all telephone surveys arises from the fact that a sizable pro- 
portion of families own radios and television sets but have no telephone. In 
sample surveys conducted by the U.S. Census Bureau in 1960 and 1961, 
22 per cent of all households had no telephone, and, of these, 70 per cent 
owned one or more television sets (103, 104). In a telephone sample, the 
lower-income families will be underrepresented. Hence, the sampling loss is 
likely to be selective with regard to programs, since the relative popularity 
of different programs varies with socioeconomic level. The total extent of 
radio listening or television viewing may also be related to telephone owner- 
ship. One survey found that nontelephone families listened to the radio on the 
average 44 per cent more hours per day than did telephone families (5). 
Estimates of total coverage, as well as of the relative audience size for differ- 
ent programs, may thus be distorted through the use of telephone samples. 

Both mailed questionnaires and personal interviews have been employed in 
the effort to avoid the biases of telephone samples. Mail inquiries, however, 
are likely to vield a small proportion of returns, subject to a number of un- 
known selective factors. House-to- house interviews are regularly conducted in 
large metropolitan areas by a radio and television survey service known. as 
The Pulse. Covering a 12-hour period, Pulse interviewers first ascertain during 
what times (if any) the respondent listened to the radio or watched television. 
"They then show the respondent a roster of programs that were broadcast dur- 
ing each of these times, with the request that the respondent indicate those 
he heard or saw. Although obtaining a more representative sample than tele- 
phone surveys, this roster technique is subject to the limitations of all recogni- 
tion tests. ; 

A technique designed to avoid both the sampling limitations of telephone 
and mail surveys and the memory errors of mail and interview procedures 
utilizes a mechanical recorder. Originally developed by Stanton (96) for use 
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with radios, this technique has undergone many mechanical improvements 
and now forms the basis of the Nielsen Audimeter (76). The Audimeter is 
a small electronic device that is installed in the home and keeps a record of 
all times when radio or television sets are in use, as well as the stations to 
which they are tuned. As many as four radio or television sets can be simul- 
taneously covered by a single Audimeter. Once installed in a closet or other 
convenient place in the home, the device works silently and unobtrusively. 
New tape is inserted every two weeks, and the used tape is removed for 
analysis. 

To be sure, the Audimeter does not show which members of the family 
listen to or watch each program, nor whether anyone at all is in the room at 
the time. Some people do turn on a radio and then forget about it, walking 
out of the room or even going out of the house without bothering to turn it 
off. That this does not occur as often as might be supposed, however, was in- 
dicated by an analysis of Audimeter results which showed that the average 
radio-listening period without change of station was only 28 minutes ue 
Only 5 per cent of the listening periods were over 1 hour long, and 56 per ont 
were under 15 minutes. 

Audience Reaction. In the effort to obtain an objective record of audience 
preferences, Stanton and Lazarsfeld developed the Program Analyzer (sce 41, 
81). Representative samples of subjects, drawn from groups of volunteers, 
meet in a studio to listen to or view a program. Each subject is given a red and 
a green pushbutton and is instructed to press the green button during the parts 
of the program he likes and to press the red button during the parts he dis 
likes; when he is indifferent, he presses neither button (see Figure 74). Each 
red and green button is connected with pens that provide a continuous record 
on tape. Since time is simultaneously recorded on the tape, the parts of the 
program that each subject liked or disliked can be identified. After the ob- 
servation period, subjects are usually questioned regarding the reasons for 
their likes and dislikes. 

Figure 75 shows typical results obtained with the Program Analyzer. The 
height of the graph above and below zero indicates the number of subjects 
who liked or disliked each part, respectively. The trend line provides a single 
index that combines frequencies of liking, disliking, and indifference reactions 
for the group. This graph was taken from a comparative study of audience 
preferences in New York, Boston, and Hollywood (81). The results proved 
to be quite similar and suggest that tests conducted in the New York studio 
may provide a reliable index of national audience reactions, at least for certain 
types of programs. d 

Some research has also been done on radio and television commercials. The 
Schwerin Research Corporation, for example, has used rough, inexpensive film 
versions of television commercials in pretests of audience recall. Follow-up 
studies showed that the percentages of persons who remembered the brand 
name and at least one sales point were practically identical for the rough 
version and the finished version (92). 

Preferences for different types of programs, such as classical music, news, 
and SOA p Operas: have been investigated through personal interviews of repre 
sentative national samples (57). These preferences have been analyzed with 
reference to age, sex, educational level, and other demographic variables. They 
have also been studied with reference to reading habits, attendance at motion 
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Fig. 75 Profile of listener reactions to a radio program investigated with 
the Program Analyzer. (From Peatman and Hollonquist, 81, p. 273.) 


pictures, participation in social activities, and personality traits (38, 57). Special 
studies of the content of soap operas and of the characteristics of their audi- 
ences suggest that many listeners regard the soap opera as a source of advice 
in solving their own problems, although the "advice" is usually superficial and 
unrealistic (3, 38). Mention should also be made of the use of depth inter 
viewing and projective techniques in more intensive qualitative investigations 
of the emotional reactions of audiences to radio and television programs (28). 
These studies represent an application to radio and television of the techniques 
of motivation research to be discussed in Chapter 11. 

Sales Effectiveness of Radio and Television Advertising. Well-con 
trolled studies of the effect of radio or television advertising are few. Generally 
these studies utilize some adaptation or combination of sales tests and brand 
use surveys. An outstanding example is a study reported by Stanton (97) on 
the effect of a radio program that had been broadcast for six months over a 
limited chain. By checking dealers’ inventories and sales records, the investi 
gators compared sales in two areas that were similar with regard to population, 
number of retail outlets, and previous sales of the product studied. In one 
area, however, the product was advertised by a radio program; in the other it 
was not. The survey showed 88 per cent more sales in the radio area than in 
the nonradio area. In the same study, households in the radio area were sur 
veyed by the coincidental telephone technique to discover who was listening to 
the program. Brand use by these families was later ascertained through pantry 
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inventories conducted in each home by field investigators. The results revealed 
that among the listeners the brand advertised on the program was used by 
81 per cent more families than the next most popular brand; among the non- 
listeners, it was used by only 7 per cent more. 

A study of the sales effectiveness of television advertising was conducted in 
Fort Wayne, Indiana, by the National Broadcasting Company (24). Field in- 
terviewers visited 7,500 households in the fall of 1953, before television was 
introduced to that city, and again in the spring of 1954, after one Fort Wayne 
station had been on the air for about five months. A total of 24 brands that 
were advertised on TV were analyzed with regard to purchases on the two 
occasions. In TV households, reported purchases of these brands rose by 6.0 
per cent; in non-TV households, they rose by only 1.7 per cent (24, pp. A-15, 
A-16). The excess of purchases in TV households was larger for brands that 
devoted more time to TV advertising than for those devoting less time to it. 

When brands advertised on TV were compared with those not advertised on 
TV, more striking differences were obtained. Among TV set owners, pur- 
chases of Scotties, a brand advertised on 'TV, rose from 15 to 4o per cent; in 
the same group, purchases of Kleenex, not advertised on TV, dropped from 
35 to 32 per cent (24, p. 27). With the coming of television, those brands 
advertised on TV also showed marked improvements in measures of brand 
awareness, brand-product association, trademark recognition, slogan identifica- 
tion, brand reputation, and brand preference. Brands not advertised on TV, 
included in the questionnaire for control purposes, showed negligible or even 
negative changes in these measures over the same period (24, p. A-8). 


PERSONAL SALESMANSHIP 


Salesmanship may be regarded as one form of advertising. Its purpose, like that 
of all advertising, is to show how a consumer want can be satisfied by a cer- 
tain product or service. It is the most flexible form of advertising because it 
can be adapted fully to the needs, wants, and characteristics of each customer. 
The general approach, language, specific selling points, and all other aspects 
of the sales process can be chosen to suit the individual prospect. No other 
form of advertising permits this degree of individual adaptation. Moreover, 
the salesman can modify his approach as he observes the customer's reaction. 
The face-to-face sales situation provides immediate feedback to guide the 
course of the sales interview. ar 

Books on salesmanship are plentiful. There is no shortage of opinions on 
how to be an effective salesman, but facts are scarce. Most books on salesman- 
ship have been written by nonpsychologists. One of the few exceptions is a 
book by Husband (44). Although not based on research findings, this book 
provides a competent treatment of the subject that is in line with general 
psychological knowledge and contains many useful suggestions. More recently, 
Cash and Crissy (22) have tried to incorporate relevant facts and principles 
of psychology into a series of short, popularly written books for salesmen. Four 
volumes of this series have been published, and several more are in preparation. 

Generalizations about salesmanship in the abstract are likely to be of little 
value because of the many varieties of selling. The wholesale salesman, the 
door-to-door salesman, and the salesclerk in a retail store are obviously per 
forming different functions. Within any of these types of selling, moreover, 
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problems and techniques vary widely with the nature of the product to be 
sold and the characteristics of the customers. The sex, age, education, and 
socioeconomic level of customers in a particular market will partly determine 
the sort of sales methods—and salesman—that will succeed. Similarly, an 
excellent shoe salesman may be a failure in selling electronic computers or life 
insurance. The fact that several different sales keys have been developed for 
scoring the Strong Vocational Interest Blank illustrates the specificity of sales- 
man qualifications (see Chapter 3). Even for what appear to be closely related 
sales jobs, the interest patterns reflected in these scoring keys may be quite 
dissimilar (see 43). 

Rather than trying to evolve generalized principles or “formulas” of effec- 

tive salesmanship, psychologists have concentrated on the selection and train- 
ing of salesmen for specific jobs. These were among the first problems that 
occupied such pioneers in applied psychology as Walter Dill Scott and Walter 
V. Bingham at the Carnegie Institute of Technology. Within Carnegie In- 
stitute’s Division of Applied Psychology, which was organized in 1915, the 
Bureau of Salesmanship Research was established in 1916 and the Research 
Bureau for Retail Training in 1918 (see 31). The most promising technique 
for salesman selection has proved to be the empirically developed, weighted 
application blank (see Chapter 4). For high-level sales personnel, the selection 
procedures should include an intensive guided interview and a battery of tests 
chosen to fit the specific job. Attempts to devise general tests of “sales aptitude” 
have met with little success. This finding is not surprising in view of the 
specificity of qualifications for different sales jobs. 
t Sales training is receiving increasing attention. The initial on-the-job train- 
ing period may vary from a few days of orientation to over a year of technical 
preparation. Refresher courses and periodic sales conferences are also a com- 
mon practice in many companies. One objective of sales training is to acquaint 
the salesman with his product. The salesman needs up-to-date knowledge re- 
garding the special characteristics of his product, its possible uses, and the 
comparative features of competing brands. This information provides him with 
concrete facts that he may use in his sales talk and in his answers to customer 
questions. With regard to the human-relations aspects of sales training, the 
objectives and procedures are quite similar to those of supervisory training (see 
Chapter 6). The various forms of role playing developed for use in super- 
visory training are especially suitable for salesmanship training (33). 

Research on the sales process itself is very meager. A few laboratory studies 
have utilized ratings and other qualitative judgments of selected features of 
simulated sales interviews Ce.g, 30, 67, 74). Although suggesting leads for 
further investigation, such studies are limited by their artificial conditions and 
subjective evaluations. Field studies of salesmen yield more realistic data, al- 
though conditions are difficult to control in such studies. 

One study was concerned with the contribution of communication skills in 
sales effectiveness (79). From a group of women engaged in part-time door- 
to-door selling for a nationwide sales organization, 20 “more effective” and 
17 “less effective” saleswomen were chosen on the basis of their sales records. 
An interviewer who had no knowledge of the subject’s sales record evaluated 
each in selected aspects of oral communication, such as use of voice and 
language, listening behavior, utilization of emotional appeals, and dramatiza- 
tion. Several significant differences were found between the two groups, sug- 
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gesting the desirability of considering communcation skills in selection and 
training programs for this type of selling. 

Some provocative results were obtained in a field survey of driver-salesmen 
for a large company operating on a national scale (107). When such ex- 
traneous factors as geographic location and store differences were ruled out, 
sales records proved to be more closely associated with “sales methods” than 
with “sales personality.” The men described by the field observers as “negative, 
colorless, weak, and drab,” for example, were getting the same average size 
of order per stop as those described as “genial, cordial, familiar, easygoing, 
backslapping.” On the other hand, those who began each call by making a 
count of the dealer's stock of the company's products received orders that 
averaged 2⁄4 times as large as the orders obtained by salesmen who did not 
make this their first step. Similarly, those who told the dealer about the week's 
special offer before taking the regular order averaged 27 per cent more sales 
per call than did those who wrote up the regular order first. 

The critical-incident technique (see Chapter 2) offers a promising approach 
to the analysis of selling behavior. This technique was used in a study con- 
ducted in the Minnesota Mining and Manufacturing Company, which em- 
ploys over 1,000 salesmen (54). Each of 85 sales managers was asked to submit 
as many critical incidents as he could to illustrate both effective and ineffective 
behavior observed among his group of salesmen. A total of 96 usable incidents 
was reported, including 61 examples of effective behavior and 35 examples of 
ineffective behavior. The incidents fell into the following 15 categories (54, 


p. 56): 


Following up: (a) complaints; Cb) requests; Cc) orders; (d) leads 
Planning ahead 

Communicating all necessary information to sales managers 

Communicating truthful information to managers and customers 

Carrying out promises 

Persisting on “tough” accounts : i 
Pointing out uses for other company products besides the salesman's own line 
Using new sales techniques and methods 

Preventing price cutting by dealers and customers 

Initiating new sales ideas 

11. Knowing customer requirements 

12. Defending company policies 

13. Calling on all accounts 

14. Helping customers with equipment and displays 

15. Showing nonpassive attitude 


90 BI AVERY PY = 


The largest number of incidents involved some form of follow-up. Persisting 
with tough customers and planning were also mentioned frequently. Although 
the factors cited differ in weight, failure with respect to any one of them 
could lead to failure on the entire job. The factors mentioned and their relative 
importance would of course vary somewhat with the type of selling, the prod- 
uct sold, and the company. Follow-up, for example, would undoubtedly be a 
less prominent factor for a retail salesclerk in a store than it was in this group 
of salesmen. y ; 

Studies such as these lend support to a concept of salesmanship that is 
rapidly gaining ground today. This is a concept based on customer-oriented 
selling, two-way communication, and service. An important aspect of the sales- 
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man’s task is to learn all he can about the needs and wants of the individual 
customer. In wholesale and technical selling, this may entail extensive prepara 
tion and thorough study of individual companies. For the retail salesclerk, 
it means asking relevant questions and listening to the customer. It means, too, 
that the salesman should avoid such practices as jumping to conclusions, 
“sizing up” the customer prematurely, trying to fit every customer into a 
stereotype, or high-pressuring the customer into buying something he does not 
want. The last-named practice represents shortsighted selling, which leads 
more often to the return of merchandise than to the return of customers. 

The current emphasis on product information in salesmen training programs 
provides the salesman with a store of knowledge from which he can choose 
relevant and useful facts to communicate to each individual customer. These 
concrete facts are of more interest to the customer than is the routine repeti- 
tion of such empty phrases as “wonderful quality,” “colossal bargain,” and “a 
great buy.” Finally, it is more important for the salesman to ask himself in 
what ways he can be of service to each customer than to try to change his 
own personality. With regard to “sales personality,” there seems to be room for 
wide individual differences in successful selling. Each individual should try 
to utilize his unique pattern of traits to best advantage, rather than trying to 
follow someone else's rules for getting along with people. 


CONSUMER OPINION SURVEYS 


Purposes of Opinion Surveys. It is now generally recognized that the 
development and marketing of consumer goods and services requires two-way 
communication. It is wasteful for the manufacturer to devise a product that he 
thinks consumers want and then use advertising and selling techniques to try 
to convince consumers that they do want it. A more effective procedure is to 
investigate consumer wants first and then develop products and services to meet 
those wants. It is better to avoid sales resistance than to overcome it. 

Consumer surveys may be used as a guide in designing new products, in 
modifying existing products, or in preparing advertisements for products al- 
ready on the market. The utilization of consumer research in product design 
may be regarded as an extension of engineering psychology (see Chapter 9). 
Consumer survey data may help not only in developing the product itself but 
also in designing the package or in choosing a trade name (see, e.g, 15). For 
the advertiser, consumer research provides information on those features of the 
product that are most important to the consumer. Do people who own electric 
clocks like them chiefly because of their high level of accuracy, because of 
their dependability, or because they do not require daily winding and setting? 
Consumer surveys may also reveal new uses for a product. When Kleenex was 
first developed, it was marketed as a facial tissue to remove cold cream. Its 
makers later learned, however, that many purchasers used it as a handkerchief. 
Accordingly, subsequent advertising focused on the merits of Kleenex for this 
purpose, stressing the fact that as a handkerchief it is cheap and sanitary and 
requires no laundering. The result was a sharp and continuing rise in sales, 
which made possible repeated reductions in price (36, p. 187; 77). 

The procedures followed in conducting consumer surveys are essentially the 
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same as those employed in public opinion polls. Although the preelection polls 
are undoubtedly the best-known examples, public opinion polls are regularly 
conducted to ascertain prevailing opinions on many political, economic, and 
social questions. The information gathered in such surveys over the years 
provides a valuable pool of raw data for social science research (see 19, 21, 88). 
Public opinion polls and consumer surveys are often handled by the same 
organizations. Many are conducted by commercial survey organizations, such 
as those established by George Gallup or by Elmo Roper. Certain universities 
have research bureaus that specialize in this type of survey, such as the Survey 
Research Center at the University of Michigan and the Bureau of Applied 
Social Research at Columbia University. Public opinion polls are regularly 
conducted by certain government agencies, notably the Department of Agricul- 
ture. Consumer surveys are also carried out by some advertising agencies and 
by large manufacturing companies. 

Sampling Problems. Even when based on impressively large samples, 
opinion surveys may yield totally misleading results because of either poor 
methodology or a deliberate attempt to support a foregone conclusion. lt is 
not difficult to find examples of surveys rendered worthless by inadequately 
trained or unscrupulous investigators. The best protection against such mis- 
representations is a knowledge of how to evaluate an opinion survey. The 
basic problems, pitfalls, and procedures of opinion surveys have been fully 
discussed in several publications that should be consulted for further details 
Ce.g., 12-14, 20, 70, 8o, 86, ror). 

Once we have decided what we want to ask in a survey, we need to de- 
termine whom to ask and how to ask it. The first question concerns sampling. 
Although there are several kinds of sampling procedure, a basic distinction 
is that between random and stratified sampling. A random sample is one in 
which each individual in the given population has an equal chance of being 
chosen and each choice is independent of every other choice. It could be 
illustrated by taking every tenth name in an alphabetical list of all names in 
the population. The traditional practice of “pulling a name out of a hat 
accomplishes the same purpose. : B 

A stratified sample is a miniature replica of the total population with 
respect to chosen variables. For example, if a sample is to be stratified with 
regard to education, the proportion of persons in each educational level, or 
“stratum,” must be the same as that in the total population. If 8 per cent of 
the given population are college graduates, then 8 per cent of the persons in- 
cluded in the sample must be college graduates, and so on. The sample may 
be stratified separately with regard to several variables, such as age, sex, 
education, income level, and geographical location. In- this case, the distribu- 
tion of each of these variables in the sample would match its distribution in 
the population. A more precise procedure is to stratify the sample for all 
chosen variables simultaneously so that each cell in the cross-classification 
contains the correct proportion of persons. One cell of such a sample, for in- 
stance, might call for five female high school graduates between the ages of 
thirty and thirty-five. Because this procedure is very time-consuming, most 
surveys rely on the simpler type of stratification with no cross-classifications. 

A number of surveys employ area sampling, in which small geographic units 
are sampled, rather than persons. In national surveys, this procedure requires 
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the sampling of towns and rural areas. When applied within a city, it involves 
the sampling of blocks or other small units. Each interviewer is then given 
specific addresses at which he must call. Area sampling may itself be random, 
stratified, or a combination of both types. 

Sample size is not as important as representativeness. Through the use of 
stratified samples it is possible to obtain precise estimates of population data 
with fewer cases than are needed with random sampling. Most national surveys 
now use between 3,000 and 5,000 cases. The percentages of each response 
obtained in such samples have sufficiently small sampling errors for most 
purposes. 

By computing sampling errors, the investigator can determine the range 
within which the corresponding population percentages are likely to fall, as 
well as the significance of the differences between obtained percentages. For 
instance, he might find that 35 per cent of the respondents in the sample are 
dissatisfied with the refrigerator service provided by Company A. By computing 
the standard error of this percentage, he finds that the percentage of such dis- 
satisfied customers in the population from which the sample was drawn may 
range from 33 to 37. By the same statistical procedures, he may establish that 
the percentage of complaints is significantly smaller for the refrigerator manu- 
factured and serviced by Company B than it is for Company A's refrigerator. 

Although a certain minimum sample size is required for reliable results, 
further increases in number of cases are generally wasteful. Moreover, large 
samples provide no guarantee that the results are accurate. One of the most 
famous examples of this point is the Literary Digest poll, based on the replies 
of 2% million cases (see 2). This poll, which in 1936 predicted the election 
of a presidential candidate who was actually defeated by an overwhelming 
majority, highlights two faulty sampling procedures. First, the sampling was 
biased, since the mailing lists were obtained from telephone directories and 
automobile registrations. These sources yielded a sample that overrepresented 
higher-income groups and underrepresented lower-income groups. Second, the 
proportion of nonrespondents was high. Although there were approximately 
2¥% million respondents, the number of ballots sent out was 1o million. 

Considerable research is now available on the effects of nonresponse bias, 
or volunteer error, upon the representativeness of samples (6, 18, 27, 422- 
This research demonstrates that survey respondents differ from nonrespondents 
in a number of characteristics that may be correlated with the answers they 
give. Similarly, persons who volunteer for any sort of investigation, whether 
in the laboratory or in the field, differ from those who do not volunteer. 
Differences have been found both in demographic variables, such as age, sex, 
occupation, and income level, and in personality characteristics, such as self- 
esteem, conventionality, and emotional adjustment (see 6). The specific dif- 
ferences undoubtedly vary with situational factors, such as the nature of the 
experiment, the topics covered in the survey, the sponsoring organization and 
the data-gathering procedures. ; 

In house-to-house interviews, some of the nonrespondents are persons who 
were not at home when the interviewer called. Unless proper precautions are 
taken, a daytime survey will underrepresent employed persons and will over: 
represent elderly, retired persons and mothers of young children. Other non- 
respondents are those who refuse to cooperate when contacted. Still a different 
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situation is involved when a general call is issued for volunteers to participate 
in a laboratory study. With regard to mailed questionnaires, it has been found 
that those who respond are more likely to be favorably disposed toward the 
company or product than those who fail to respond. 

Several procedures have been proposed for handling nonresponse bias. Re- 
peated follow-ups may be employed to increase the proportion of respondents. 
A random sample of the nonrespondents in a mail survey may be personally 
interviewed. The demographic characteristics of the respondents may be com- 
pared with those of the total population or with the specifications originally 
laid down for the stratified sample. Or the nature and extent of nonresponse 
bias in the results may be estimated by comparing the replies obtained from 
early and late respondents or from those who responded to the initial request 
and those who responded to the follow-up. These findings may then be extra- 
polated to those who failed to respond at all. 

Questionnaire Construction. A second major aspect of opinion-survey 
methodology is questionnaire construction. Poorly formulated questions can 
play as much havoe with survey results as can improper sampling procedures. 
The wording of questions and the general appearance and layout of the ques- 
tionnaire are of primary importance in surveys conducted by mail. Questions 
used in telephone or house-to-house surveys, however, also need to be carefully 
formulated in advance. 

The question writer can profitably examine the extensive research literature 
on the effects of different question forms upon replies (see, e.g., 11, ch. 7; 14; 
20; 46; 56; 85). Acquaintance with the findings of this research. will alert 
the investigator to certain common pitfalls he should avoid. The trend of 
opinion revealed in survey replies can be significantly altered and even re- 
versed by changing the grammatical form of the question, the number of 
response options provided, or the order in which questions are presented (20, 
ch. 2). In a questionnaire on cooking ranges, significantly different percentages 
of preferences were obtained when alternatives were presented verbally or 
pictorially (105). The inclusion of emotionally toned terms such as liberal 
or “democratic” or the insertion of the name of a prominent person In an 
opinion questionnaire can substantially affect results. ; : 

Leading questions should of course be avoided. But some leading questions 
may appear innocuous to the investigator. Ambiguities are also difficult to de- 
tect, however well informed and experienced the writer may be. Since the 
writer knows what he wanted to say, he can rarely perceive other possible 
meanings in his question. Although much can be done in advance to improve 
the formulation of questions, pretesting provides the only sure way to iron out 
all difficulties. The questions should be pretested on persons typical of the 
respondent populations for which the survey is designed. Several revisions may 
be required to develop a satisfactory questionnaire. 

The social desirability variable discovered in personality test responses Csee 
Chapter 3) also affects the answers given in consumer surveys. The tendency 
to give what the respondent perceives as the socially more desirable mor 
is particularly strong in face-to-face interviews (20, ch. 5). If people are aske 
directly what magazines they read, for example, the replies will grossly xil 
estimate the readership of such prestige magazines as the Atlantic dit 4 
and will show negligible readership for the pulp magazines, which actually se 
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by the millions (23, 60). For this reason, magazine and newspaper readership 
surveys employ indirect and check questions to ascertain whether or not the 
respondent has read the latest issue of a particular medium. 

The growing body of research on individual response styles in questionnaire 
replies is highly relevant to consumer surveys. Such studies have revealed 
consistent individual differences in response styles, which may be correlated 
with other behavioral characteristics of the respondents. For instance, in- 
dividuals differ in their tendency to answer "Yes" or *No" when asked whether 
they use a certain product, read a particular magazine, or look at a TV 
program (106). The “yea-sayers” are also more likely to express a clear-cut 
preference between two articles than are the "nay-sayers" and to give more 
favorable ratings to products, programs, and other items on which opinions are 
solicited. Insofar as these response styles reflect basic personality differences, 
they may serve as useful predictors of other behavior, such as impulse buying, 
susceptibility to persuasion, or preference for certain media. From another 
angle, however, a disproportionate number of yea-sayers or nay-sayers in a 
group could distort results in a variety of ways (see 106). Either an in- 
dependent measure of this response tendency should be obtained for cach 
subject, or questionnaires should be formulated so as to minimize its operation. 

Interviewer Variables. In telephone and face-to-face surveys, the sclec- 
tion, training, and supervision of field interviewers represent an important 
phase of the project. Survey results may be influenced in many ways by the 
interviewer's appearance and behavior. The interviewer's race, sex, and socio- 
economic level have been found to affect responses to questions pertaining to 
race relations, sexual behavior, and labor or economic issues, respectively (see 
4; 20, ch. 8; 48, ch. 7). 

The interviewer's own opinion on the question under investigation likewise 
affects results (20, pp. 107-112). It is interesting to note that when results 
have been analyzed with regard to community size, interviewers’ opinions have 
been found to affect the distribution of responses in small towns and rural 
areas, but not in large cities (20, p. 112). Among the possible reasons for this 
difference, the investigators pointed out that in smaller communities inter- 
viewers are more likely to be acquainted with their respondents. This fact may 
operate in two ways. First, interviewers consciously or unconsciously tend to 
choose respondents who share their views. Second, the respondents are more 
likely to know the interviewer's views and hence express an opinion that 
agrees with his. 

Anything in the interviewer's appearance, dress, manner, or comments that 
tends to associate him with a certain point of view may bias interview results. 
This is true even when the cue may be false, as in the case of popular 
stereotypes. In a survey on the use of hard liquor, two interviewers happened 
to look like the popular stereotype of a "prohibitionist." The samples inter- 
viewed by these two investigators reported much less use of liquor than did 
those questioned by the other interviewers (94). 

Respondents are also influenced by the interviewer's behavior during the 
interview. Any indication of agreement or disagreement by gesture, facial 
expression, or comment may affect subsequent responses. An experiment on 
verbal reinforcement was conducted in the form of a telephone survey of public 
opinion (39). Interviewers in one group were instructed to say “Good” when 
the respondent expressed agreement with the question, those in another group 


273 


Scope and Methods of Consumer Psychology 


to say "Good" when he expressed disagreement. In two other groups, "Mm- 
hmm" was used as a reinforcement in the same ways. The response "Good" 
proved to bias the results significantly, while “Mm-hmm” did not. 

When interviewer bias affects the respondent's answer—through appearance, 
behavior, or known opinion of the interviewer—the effect may be regarded as 
a special instance of the social desirability variable that operates in self-report 
techniques generally. Social desirability may be defined in terms of different 
groups with which the respondent identifes. There are very broad norms 
with regard to the moral and lawful behavior of civilized society; there are 
narrower national, regional, or class norms with reference to many questions 
of group relations, economic policy, patriotism, and the like; and there are 
still narrower norms established by the characteristics and behavior of the 
individual interviewer. "Interviewer bias" in the replies of respondents may 
thus be regarded in part as the subject's effort to give the "right answer" as 
defined by the immediate situation. Even with an interviewer who is a total 
stranger and whom they never expect to see again, most people are reluctant 
to "make a bad impression." 


PRODUCT TESTING 


Brand Identification and Brand Preference Studies. It has often been 
asserted that different brands of many consumer goods are indistinguishable, 
once the brand labels and other artificial cues are removed. So-called “blind- 
fold” tests require careful methodological controls. Otherwise, either positive 
or negative results may be spurious. It is essential, of course, that all extraneous 
cues be effectively ruled out. For instance, touch cues might facilitate the 
identification of some brands of cigarettes. On the other hand, negative results 
need not indicate lack of discrimination (see 59, 102). Subjects may be unable 
to name brands correctly—especially in the case of obscure brands whose 
names are unfamiliar—even though they can discriminate the taste in a paired 
comparison test and may consistently prefer one of the brands. Another meth- 
odological problem arises from the presentation of several brands during the 
same experimental session. The procedures followed to eliminate sensory adap- 
tation, prevent mixture of sensations, and otherwise ensure independence of 
observations may be inadequate or may themselves introduce cumulative 
effects. A 

Product testing provides an opportunity to apply the psychophysical 
methods of the psychology laboratory to consumer research. Both laboratory 
and field studies have been conducted on brand identification and brand 
preference with such commodities as cigarettes, cola beverages, shaving creams, 
bread, and beer (see 32, 59, 102). Two investigations are of particular interest 
because of their contributions to methodology. 

One study was concerned with the ability to discriminate among three 
popular brands of cigarettes (59). The subjects were 246 college students, all 
chosen because they were habitual smokers, reported that they smoked one of 
the three brands under investigation, and actually had a package of that brand 
with them at the time of the test. The subjects, of course, did not know what 
brands were being used in the experiment. Tactual and visual cues were 
eliminated by covering the upper half of each cigarette with a gummed paper 
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band. Each subject was asked to smoke only one cigarette, the three brands 
being used with different subjects in random order. The experimenter was in 
ignorance of the brand smoked by any one subject until that subject had left 
the room. Half of the subjects were asked to state whether they liked or dis 
liked the cigarette; the other half were asked whether they thought it was or 
was not the brand they habitually smoked. 

Both types of judgments were made with better than chance accuracy. In 
other words, subjects demonstrated significant ability to identify or differentiate 
their own brand. They also preferred their own brand significantly more often 
than the other two. To be sure, the differences were not large, and many 
“errors” occurred. But the fact that the discriminations were significantly 
better than chance suggests that the procedures followed in this experiment 
may be more sensitive than the brand-naming techniques, which have usually 
yielded negative results. 

The other study chosen for illustration was a field investigation of beer 
consumption by a representative sample of families in one city (32). The con- 
sumer panel consisted of 20 families that were regular beer consumers, were 
willing to cooperate in the study, and constituted a stratified sample of the 
general beer-drinking population of that city with regard to social level. In 
contrast to most laboratory and field studies, which call for momentary ex 
pressions of preference, this study allowed a seven-day period for the develop- 
ment of preferences. Each day, 48 identical-looking bottles of free beer were 
delivered to each family. Included were 8 bottles of each of six brands, 
identified by the colors of the bottle caps. On any one day, subjects could 
thus continue to use the first brand tried or could keep trying new brands 
until they found one they liked. From one day to another, however, the color 
code was changed. Thus the subjects had to make a new set of choices each 
day. The subjects recorded the brands they drank each day (by color of caps), 
as well as their preferences. Empty bottles, bottle caps, and unused bottles 
were collected daily. 

Significant brand differences were found both in number of bottles con- 

sumed and in recorded preferences. Expressed brand preferences agreed with 
data on consumption. Apparently beer drinkers can differentiate among certain 
brands even in the absence of brand names. 
; Product Development Studies. A major use of product testing is to aid 
in the development of new products or the improvement of existing products. 
The experimental techniques employed for this purpose are quite varied, as 
are the problems investigated. A few examples will illustrate this diversity. 
When rayon was first put on the market, it encountered a certain amount of 
consumer resistance because of its unfamiliar “feel.” Accordingly a study was 
conducted on the tactile sensations reported by subjects when feeling different, 
experimentally produced weaves of rayon fiber, as well as the pleasantness or 
unpleasantness of the sensations (87). Other studies have dealt with such 
questions as the perceived crispness of cereals stored in different kinds of 
packages, consumer preferences for oval or rectangular sponges, and the best 
color combination for a cake box (25). One experiment demonstrated that 
bread is perceived to be fresher when wrapped in cellophane than when 
wrapped in wax paper (17). The subjects judged the “freshness” of equally 
fresh loaves by the familiar practice of squeezing the loaf. The resulting tactual 
sensations were evidently influenced by the nature of the wrapper. 


275 


Scope and Methods of Consumer Psychology 


In designing new telephone equipment, the Bell Laboratories regularly con- 
duct laboratory tests with small local samples, followed by field tests with larger 
and more representative consumer samples. An example is provided by a 
series of four laboratory studies conducted with all-numeral dialing (AND), 
in which numbers replace the letters of the traditional letter-numeral dialing 
(LND) system (49). Among the advantages of AND are that it requires less 
dial space and avoids the common confusions between the letter O and the 
number zero and between the letter I and the number one. It remains to be 
seen, however, how AND and LND compare in accuracy and convenience in 
actual use. Experiments were accordingly designed in which subjects were 
required to look up, dial, or recall the two kinds of telephone numbers under 
controlled but realistic conditions. Some experiments were conducted in a 
special test room, some in the subjects’ own offices, and some in their homes. 
Each experiment extended over several weeks. 

The two systems were compared with reference to speed and accuracy of 
performance as well as expressed preferences. Although it required slightly 
longer to memorize AND telephone numbers, the results generally favored 
the new system and did not reveal any serious difficulties in the changeover 
process, Field trials were initiated following the laboratory tests. The final 
decision will take into account the results of many such studies. 

Mention should also be made of the use of anthropometric data in designing 
such consumer products as clothing; seats for railway cars, airplanes, auto- 
mobiles, and auditoriums; household furniture; and telephone booths, to name 
only a few examples (see 65, ch. 13; 66, ch. 11). Although the collection and 
analysis of anthropometric measures are not strictly speaking product testing, 
they are consumer research in a broad sense. In the design of certain products, 
anthropometric measures are utilized together with behavioral data and con- 
sumer preferences. 

Unlike opinion surveys and other verbal techniques, product testing can be 
applied to any kind of subject. An example of product testing with a rather 
special kind of consumer population is shown in Figure 76. ; . 

Food Acceptance Studies. The Quartermaster Food and Container Insti- 
tute for the Armed Forces conducts extensive laboratory and field studies on 
food preferences (see 34, 82, $3). Although the immediate object of this re- 
search is to improve meal planning in the armed services, it has made signifi- 
cant methodological contributions to consumer psychology as a whole. Many of 
the techniques developed at the Quartermaster Institute are applicable toa 
variety of problems pertaining to the manufacturing and distribution of con- 
sumer goods, And some of the findings throw light on basic characteristics of 
the rating process. ; : 

Much of the research of the Quartermaster Institute is carried out in a 
specially built food acceptance laboratory, with individual booths for each 
subject. One investigation (82) was designed to see how closely results ob- 
tained with a civilian sample in this laboratory approximated those obtained 
with a soldier-consumer sample tested at an army camp by the same procedure. 
In both groups, each subject was served samples of 3 foods in a single test 
session. Four sessions were employed to test a total of r2 foods; the entire 
series was replicated in different combinations to provide estimates of relia- 
bility. The subject recorded his reaction to each food on a nine-interval graphic 
rating scale ranging from “like extremely” to “dislike extremely.” This rating 
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Fig. 76 Some consumers are not human. A boxer participates in a 
dog-food taste test. The different varieties of the product presented in the 
three pans are weighed before they are given to the animal and again 
after removal. (Courtesy of The Quaker Oats Company.) 


scale, known as the "hedonic scale," is used regularly in the Quartermaster 
Institute studies. 

Both laboratory and field studies yielded satisfactory reliabilities, the co 
efficients ranging from .80 to .96. The correlations between laboratory and 
field ratings assigned to the different foods proved to be equally high. It could 
thus be concluded that pretesting of rations on civilian samples in the Quarter- 
master Institute laboratory represents a satisfactorily close approximation of 
pretesting with soldier-consumer panels. Further research is in progress on 
the correspondence between such laboratory-type tests and observations under 
more natural conditions, including regular mess-hall feeding and the use of 
rations in field maneuvers. 

Another investigation was concerned with an analysis of preferences for 
lists of named foods (91). This technique, which is a type of consumer opinion 
survey, had generally yielded higher mean preference ratings for each food 
than those obtained with laboratory taste tests. One reason for this discrepancy 
may be found in the frame of reference used by respondents in rating named 
foods. The individual's actual liking for any given food, such as beef stew, 
will vary from one occasion to another partly as a function of variability in 
ingredients, manner of preparation, and other characteristics of the particular 
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serving. In a laboratory taste test, the individual responds to the food as pre- 
sented. In a purely verbal opinion survey, however, he is asked to judge the 
ceneralized food rather than a particular version of it. Are these generalized 
judgments based upon the respondent's average experiences with this food, 
upon the best examples he has encountered, or upon the poorest? It was to 
answer this question that the present investigation was undertaken. 

Three groups of naval enlisted personnel were asked to rate, on the nine- 
interval hedonic scale, 54 individual foods falling into the eight classes listed 
in Table 14. The first group was given the standard normal instructions, which 


TABLE 14 Mean Preference Ratings of Foods 
as a Function of Instructions 
(From Schutz and Kamenetzky, 91, p. 176) 
llli ci sadM dE M m 


Instructions 
doe Do wor ee 


No. of 

Foods in “Best “Poorest 

Food Group Group Normal Servings” Servings” 
Main dishes 12 6.78 6.76 6.05 
Vegetables 10 5.50 5.39 4.90 
Desserts 9 7.31 7.42 6.71 
Potatoes and starches 6 6.94 6.97 6.11 
Soups 5 5.85 5.74 5.43 
Breads 5 7.40 7.43 6.84 
Breakfast foods 4 6.75 6.78 6.03 
Beverages 3 7.24 7.13 6.80 
All foods 54 6.58 6.57 5.93 


Au toods ^ "T 


simply call for a rating of how much the respondent likes or dislikes the item. 
Subjects in the second group were asked to rate the "Best Serving of each 
food they had ever eaten, and those in the third group were asked to rate 
the “Poorest Serving.” For foods they had never eaten, all respondents were 
told to mark "Not Tried." As can be seen in Table 14, the mean ratings ob- 
tained in the three groups clearly demonstrate that, under normal survey con- 
ditions, subjects tend to rate the best example of each food they have en- 
countered. For each class of foods, as well as for all classes combined, mean 
ratings with normal instructions are practically identical with those obtained 
For "Best Servings." The ratings for "Poorest Servings,” on the other hand, are 


appreciably lower. 


ECONOMIC PSYCHOLOGY 


A branch of consumer psychology that has been taking shape since World 
War II is economic psychology. Cutting across the fields of economics and 
psychology, this area of research employs some of the data-gathering procedures 
developed in consumer opinion surveys and in the previously discussed food 
acceptance methodology. In its analysis of consumer data, economic psychology 
combines concepts taken from psychology, mathematics, and economics. Two 
types of studies will be described to illustrate developments in this area. 
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Operations Research. An important problem in economic psychology per- 
tains to the way in which the results of consumer surveys can be used in 
reaching decisions about the manufacturing and distribution of a product. In 
solving this problem, psychologists have applied techniques drawn from the 
broader field of operations research. Several attempts have been made to 
develop mathematical solutions for utilizing consumer research data in product 
planning and marketing. The manufacturer wants to know, for example, the 
optimum quality level required to maximize consumer preference. Beyond a 
certain point further increments in quality would be wasted. Consumers are 
not interested in a desk that can withstand a weight of 1 ton, a watch that is 
accurate to 1/1,000 second, or a pair of shoes that will last 50 years. They 
would be unwilling to pay for the additional cost required to achieve these 
quality levels. The question then is to determine the last, or “marginal,” quality 
increment that does affect consumer preference. 

To establish optimum quality levels empirically would be impracticable, 
since it would require too many forms of each product. The usual consumer 
testing of a few forms can show that some forms are preferred over others, but 
it cannot establish a true optimum with regard to any product characteristic. 
Benson (7-10) has proposed a systematic procedure based on the marginal 
preference model. Essentially, this procedure requires the measurement of 
consumer preferences for a limited number of product forms. From these data, 
a mathematical function relating degree of preference to qualitative product 
variables is established. The point of maximum preference can then be de- 
termined mathematically. 

Another question is concerned with the particular combination of qualities 
that will maximize consumer satisfaction (7, 8). For example, given a fixed 
amount of money for manufacturing a toothpaste to sell at a particular price, 
how should this money be apportioned to the various product qualities, such 
as taste, cleansing efficiency, and foaminess? Still another question pertains 
to the allocation of funds to packaging, advertising, and selling. Out of a 10- 
cent cost per item, what proportion should be spent on manufacturing and 
what proportion on marketing? This type of question can be answered by 
measuring the increases in product preference that result when dilferent 
amounts are spent on each. From these data, the appropriate mathemat- 
ical functions can be derived (8). 

A study by Jones (47) illustrates the use of a mathematical model in pre- 
dicting consumer purchases from preference data. The items investigated were 
15 entrées served over a five-day period at a university faculty club. The list 
of items, together with a seven-category rating scale, was mailed to 430 club 
members, of whom 297 returned completed forms. The rated preferences, as 
well as data on the relative popularity of different price levels, were used in 
mathematically derived equations to predict the percentage of persons who 
would actually purchase each of the three entrées appearing on the menu on 
any given day. These predictions proved to be quite accurate. The mean dif- 
ference between predicted and actual percentages was only 3 percentage 
points. íi 

With regard to the cost data, the maximum preference was for the inter 
mediately priced rather than for the cheapest items. Consumer reaction to 
prices is itself an important problem in economic psychology. lt is likely that 
for many types of items price is perceived as a cue to quality. In the particular 
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situation investigated in this study, the prestige factor may also have affected 
choices (see 47). Faculty members lunching with colleagues may be reluctant 
to order the cheapest item on the menu—at least in some universities! 

A brief overview of some of the ways in which operations research is being 
utilized in advertising is provided by a report of an operations research dis- 
cussion group, prepared by the Advertising Research Foundation (78). One 
of the chief difficulties in the application of operations research techniques 
to advertising decisions is the lack of adequate data on some of the factors to 
be considered. When pertinent data are available, they often represent crude 
approximations unsuitable for complex mathematical analyses. Certain mar- 
keting questions of a relatively limited nature can, however, be satisfactorily 
answered by these techniques. 

Economic Attitudes and Buying Behavior. Another aspect of economic 
psychology is illustrated by the research of Katona and his associates at the 
Survey Research Center of the University of Michigan (51-53). This ap- 
proach utilizes interviews with representative samples of consumers and busi- 
nessmen to investipate motives, attitudes, and expectations regarding broad 
economic questions. For instance, consumers are asked "Do you think that a 
year from now you people will be better off financially, or worse off, or just 
about the same?” and “Do you think that in the country as a whole during 
the next twelve months we will have good times financially, or bad times, or 


what?" (52, p. 16). 


Fig. 77 Sales of durable goods in relation to index of consumer attitudes 
and disposable personal income. CAdapted from Katona, 53, p. 52.) 
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From the replies to a series of such questions, Katona computes an “index 
of consumer attitudes,” which represents the over-all degree of optimism or 
pessimism expressed by the respondent. This index can be compared with 
various indices of economic behavior. Figure 77, for example, shows changes 
in consumer attitudes between 1952 and 1958 in relation to changes in sales 
of durable goods, such as automobiles and household appliances. It can be seen 
that major rises or declines in consumer “optimism” are followed by rises or 
declines in purchases, even though income continues to rise throughout this 
period. These findings support the view that demand is a function of both 
ability to buy and willingness to buy—the latter stemming from feelings of 
security or insecurity, confidence or uncertainty, optimism or pessimism, and 
other measurable attitudes. 

Katona and his associates have used interview surveys to investigate a variety 
of economic problems. Repeated interviews of the same individuals permit a 
study of changes in attitudes, the factors leading to such changes, and the 
effect of the changes on individual buying behavior (52). Table 15 illustrates 


TABLE 15 Purchase Rates of Families 
with Different Attitudinal Classifications 
(From Katona, 53, p. 85) 
NHTULDCIPUTITUTOIÓO EE UM M 
Index value of number of 
durable goods purchases" by 


Grouping of families n—— MÀ 
by initial or final attitudes Optimists Medium Pessimists 
Initial attitudes, purchases in first 6 months 1.15 0.95 0.88 
Initial attitudes, purchases in 12 months 1.07 1.02 0.86 
Final attitudes, purchases in 12 months 1.06 0.99 0.78 
Grouping of families Improved | Unchanged Deteriorated 
by change in attitudes attitudes attitudes attitudes 
Purchases in first 6 months 1.04 0.99 0.95 
Purchases in last 6 months 1.11 0.95 0.87 


noc ————————————M———————ÀÁ' 


a Income held constant. 


one such analysis, in which initial attitudes and later changes in attitudes are 
both shown to be related to subsequent purchases of durable goods. For this 
purpose, respondents were classified into three groups, "Optimists," “Medium,” 
and “Pessimists.” Changes in attitudes were determined by interviewing the 
same respondents on two occasions over a one-year interval. Purchase of any 
durable goods within the year was ascertained during the second interview- 
These purchases have been recorded in Table 15 in terms of an index that 
adjusts for income differences among the different attitude groups. It can be 
seen that both initial and final attitudes are related to frequency of purchase 
during the year. It is also noteworthy that improvement in attitude is associated 
with higher frequency of purchase during the latter half of the period but 
shows a negligible relation to purchases during the first half. This is to be 
expected, since attitude changes were measured at the end of the period. 
Among the other problems investigated by these procedures may be men- 
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tioned the relation of money saving to such factors as income, demographic 
variables, and attitudes. Several studies have been concerned with consumer 
motives in buying, for example, life insurance, one-family houses, and house- 
hold appliances. Particular attention in this connection was given to the ques- 
tion of "saturation." What happens to purchases after consumer needs have 
been satisfied through the extensive availability of consumer goods? The 
results support the view that, when existing needs are gratified, other needs 
take their place. When a family buys their longed-for home, they need new 
appliances and furnishings. When they have acquired all the appliances they 
can use, they want new models with improvements, and so on. 


SUMMARY 


Beginning with the psychology of advertising and selling, consumer psychology 
has gradually broadened to include research on consumer wants, preferences, 
attitudes, and buying behavior. The techniques of consumer psychology are 
also applicable to problems outside of business and industry, as illustrated by 
public opinion polls, public health campaigns, and meal planning in the armed 
Services. 

Procedures for testing the effectiveness of advertisements include the con- 
sumer jury technique, readership surveys, controlled exposure studies, brand 
use surveys, analysis of coupon returns, and sales tests. Radio and television 
research has been concerned with audience size and composition, audience 
reaction to specific programs or types of programs, and sales effectiveness of 
advertising. Among the techniques employed in such research are the hidden 
offer, telephone inquiries Cincluding coincidental calls), mailed questionnaire, 
personal interviews, mechanical recorders such as the Nielsen Audimeter, and 
the CBS Program Analyzer. Some use has also been made of depth inter- 
viewing and projective techniques to investigate the emotional reactions of 
audiences. i3 

Salesmanship may be regarded as the most flexible form of advertising. Be- 
cause of the many varieties of selling, generalizations about effective sales- 
manship are likely to be of little value. Psychologists have concentrated on 
procedures for the selection and training of salesmen for specific jobs, Available 
research on the sales process, though meager, suggests the importance of a 
customer-oriented point of view, two-way communication, and service. 

In any consumer opinion survey, the basic methodological problems concern 
sampling procedures and questionnaire construction. Sampling may be random 
or stratified, the latter requiring fewer cases. In area sampling, households in 
small geographical units are sampled, rather than persons. Sample size should 
be evaluated in terms of the resulting standard errors of percentages. Survey 
results must be checked for nonresponse bias and volunteer error. The ques- 
tions employed in a consumer survey need to be formulated with care and 
pretested on a representative sample. In telephone and face-to-face surveys, in- 
terviewer variables must be taken into account. S NN 

Product testing may take many forms, as illustrated by brand identification 
and brand preference studies, product development studies, and food ac- 
ceptance research. Economic psychology is a recent development within con- 
sumer psychology, which combines procedures from psychology, economics, and 
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mathematics. Two rather different applications of it are to be found in opera 
tions research and in the study of economic attitudes in relation to buying be- 
havior. 
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Emotional and Motivational 
Factors in Buying Behavior 


Why do consumers use certain commodities or services? What do they like or 
dislike about a product? Why do they buy one brand rather than another? 
How can advertising arouse a desire for a product and lead to actual purchase? 
Although modern techniques of consumer research have focused attention 
upon questions such as these, the importance of emotional and motivational 
factors was recognized in the earliest writing on advertising psychology. 


EAL 


The appeal is the message of an advertisement. It links the product or service 
advertised with consumer wants and needs. Whether the appeal is presented 
through written copy, illustration, spoken word, or television commercial, its 
object is to show how the commodity may satisfy consumer wants. Early 
textbooks on the psychology of advertising presented lists of "instincts," basic 
organic needs, or primary drives from which the advertiser might choose. 
Among these drives were hunger, thirst, sex, and the avoidance of pain or 
danger. These were followed by longer lists of secondary, or learned, drives, 
such as the needs for status or prestige, social approval, achievement, and 
cleanliness. 

By looking through any current magazine or newspaper, one can easily find 
examples of advertising appeals based on each of these drives. One can also 
find evidence of the operation of one or more of these drives in most human 
activities. But merely labeling an activity as a manifestation of the hunger drive 
or the achievement drive tells us little about it. Nor are traditional lists of 
drives of much help to the ad writer. For one thing, the lists drawn up by 
different authors vary widely, especially with regard to the secondary drives. 
Classifications of drives are largely arbitrary and subjective. Psychologists are 
still a long way from a satisfactory understanding of motivation; no general 
agreement has been reached about the nature or operation of motives (see 34, 
PP- 253-272). Since basic experimental data in this field are still meager, con- 
sumer psychologists must rely upon their own applied research designed to 
answer specific questions. 

The traditional concepts of drives, whether primary or secondary, are also 
too general to serve in the choice of advertising appeals. The hunger drive 
could be employed to advertise any food, from soup to candy. Yet there are 
vast differences between the appeal of a steaming bowl of soup on a cold day 
and that of a gift box of candy on one's birthday. Safety or the avoidance of 
danger is undoubtedly an important consideration in buying a car, but the 
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actual choice between makes is often based on the automobile's role as a 
status symbol. Furthermore, buying decisions are often based on multiple and 
interlocking motives. In choosing a particular cake mix, the housewife may be 
influenced by her own craving for sweets, by pride in her achievement as a 
cook, by her need for social approval from family and guests, and by a desire 
to prove she can bake a better cake than her mother-in-law. 

It might seem that a knowledge of the relative strength of different drives 
should help advertisers in their choice of appeal. But the strength of any 
drive is not a fixed quantity. Strength of primary drives depends upon degree of 
deprivation. When thwarted, some primary drives become overpowering. If 
breathing is hampered, the individual will fight violently for air. To a starving 
man, food becomes extremely important. But the advertiser is not addressing his 
message to starving consumers. When these primary drives are satisfied, other 
drives become prepotent. 

The relative strength of secondary drives depends upon learning. Hence 
their strength will vary from one culture or subculture to another. Anthro- 
pology and differential psychology provide many examples of such group 
differences in the relative strength of drives (see 1, ch. 18). New drives 
develop as the individual’s experiences change. For every drive, moreover, 
learning determines in large part what the individual does to satisfy it and 
what incentives he seeks. Cultural differences in eating customs and in food 
preferences provide a dramatic illustration of this fact. 

Finally, in his choice of appeals the advertiser should take into account the 
specificity of buying motives. It is not so much a question of whether economy 
or prestige represents the stronger drive for a consumer. Rather it is a question 
of which is uppermost in his choice of a specific commodity. For the same 
consumer, economy may be foremost in the purchase of a washing machine, 
while prestige predominates in the choice of a car. The absurdity of trying 
to use the same “tried and true” appeal for very dissimilar products inspired 
the cartoon reproduced in Figure 78. The distinguished-looking man with: the 
eye patch, featured repeatedly in Hathaway shirt ads (Figure 91), may sell 
shirts; but an eye patch on a dog would probably have little appeal for 
purchasers of dog food. : 

In selecting an effective appeal for a specific commodity, the advertiser can 
turn to the findings of opinion surveys and other types of consumer research. 
These techniques provide information on what consumers like or dislike about 
a product; what specific features led them to change brands; under what cir- 
cumstances they made their most recent purchase of some durable commodity; 
how they use a particular product; and many other concrete questions about 
the role a particular product plays in the consumer’s daily life. Analysis of 
subgroup results with regard to sex, age, socioeconomic level, or other variables 
may be helpful in the choice of suitable appeals for different consumer publics. 

Many examples could be cited to illustrate the way in which consumer 
studies have affected the advertising of particular products. A manufacturer 
of a household product had been featuring the fact that his product works 
faster." A consumer survey revealed that 58 per cent of the housewives who 
used the product said they did so because “its easy on the hands,” while only 
2 per cent mentioned that "it's quicker" (20, p. 337). The most promising 
appeal had obviously been ignored in the previous advertising of this product. 

A consumer study conducted for M & M candy found that for many persons 
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Fig. 78 Appeals are specific to the commodity. (From The New Yorker, 
Dec. 30, 1961, p. 32. Drawing by O'Brian; © 1961 The New Yorker 
Magazine, Inc.) 


candy eating is "associated with accomplishment of a job that the person who 
ate the candy considered disagreeable. The candy was a sort of reward or 
compensation for doing a tough job." Accordingly, M & M switched its ad- 
vertising theme in two test markets from "Smooth, rich, creamy coated choco- 
late—everybody likes 'em" to "Make that tough job easier—you deserve 
M & M candy" (58, p. 87). Following this change in advertising appeals, sales 
in the two test areas rose from 17 per cent to 35 per cent of the national 
M &M total. 

A third example is provided by advertisements for men's shoes, which tradi- 
tionally stressed style, price, and details about the construction of the shoe. 
A survey of 5,000 men conducted for the Bostonian shoe company found that, 
when asked what they liked about the shoes they were wearing, 42 per cent 
of the men referred to “fit and feel,” 32 per cent to “wear and tear,’ 16 per 
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cent to "style and looks," and 9 per cent to "price and value" (16). On the 
basis of these results, the advertising appeal was changed, the shoes now being 
described in such terms as “walk-fitted.” Again the change was followed by 
an improvement in sales. 

Considerable attention has been given to the relative merits of positive and 
negative appeals in advertising. Essentially, a positive appeal portrays a pleas- 
ant situation that will follow the use of the product, while a negative appeal 
portrays an unpleasant situation that may occur if the product is not used. 
Negative appeals should not be confused with negative suggestion. The 
latter involves telling customers what not to do or calling attention to possible 
faults of competing products. Except under very special circumstances, nega- 
tive suggestion should be avoided, since it may have the opposite effect from 
that intended. A familiar example of negative suggestion is that of telling a 
child not to play with matches. Such an admonition may suggest to him that 
matches are a potential plaything to be explored on the first opportunity, an 
idea that might never have occurred to him otherwise. Similarly, if women are 
told that the new, improved “Magicreme Supreme" will not enlarge pores, 
they may become alarmed about the possible ill effects of using any facial 
cream. Hence they may stop using facial creams altogether, instead of rushing 
to their nearest drug or department store to buy a jar of Magicreme. 

Negative appeals, on the other hand, may be quite effective. For certain 
commodities, negative appeals can be made more dramatic and arresting than 
the corresponding positive appeals. An example of the continued and successful 
use of a negative appeal is shown in Figure 79. In all its advertising, this 
company has focused on the appeal that the Sanforized label on a garment 
is an assurance that the cloth will not shrink. Other examples of negative 
appeals can be found in the ads for products designed to protect against 
“halitosis,” “B.O.,” “coffee nerves,” “tired blood,” and the many other familiar 
advertiser diseases. 

That negative appeals can be quite successful has been repeatedly demon- 
strated by survey data. In one extensive analysis of readership data, ads show- 
ing the unfavorable results of not using the product received much more atten- 
tion than did ads showing the favorable results of using the product (47, p. 
69). The former were also read by a larger proportion of persons than were the 
latter. Other studies have demonstrated the effectiveness of negative appeals for 
some commodities in terms of coupon returns and direct-mail sales (30, 31). 
On the other hand, there are many types of products for which negative 
appeals are inappropriate. The age and other characteristics of the consumers 
may also influence the relative effectiveness of positive and negative appeals. 
The results may likewise be affected by the specific way in which the negative 
appeal is presented. Thus, if the situation portrayed is too unpleasant, as in a 
detailed picture of a serious accident, people will simply turn away from the 
ad. Moreover, the emphasis should be placed on the action the individual may 
take to avoid the unpleasant situation. It is escape from fear that reinforces 
learning, rather than fear itself (34, p. 268). à 

Although the large majority of advertisements contain appeals, there are 
two well-known exceptions. One is the reminder ad, whose object is simply 
to keep the brand name before the public. Such ads serve to reinforce the 
associations with brand names that were established by other, conventional 
ads. They are effective with brands that are already familiar and widely ad- 
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Why Betty bewares of some wash-and-wears 
Another Ballad by SANFORI ZED-PLUS 


' 9. Se . 
1. Fashion-conscious Betty Murray 2. She slipped one on, it fit just fine 
Shopped at lunch-time in a hurry. And Betty rejoiced—“This one is mine!" 
Bound for a week end, free from care, Its pleats had a certain flirty flair 
Searched for a skirt tagged "wash-and-wear," And the tag maintained “just wash and wear.” 


3. But, along came time to wash the garment 
And Betts berated the unknown varmint 
Who had put that label on her skirt! 
Betty was angry and deeply hurt. 


4. In place of pleats—a muss of wrinkles 
And shrunk to boot! (Betts called it "shrinkles.") 
The moral, friend, is save your trust 
For items labeled “Sanforized-Plus,” 


JANFORIZED* 


TRACE P mann 


—for protection against shrinkage 


CJANFORIZED plus 9 


m 


for excellence in wash-and-wear performance 


5, This reassuring label or tag 
Will never make you weep or gag. 
Attached to wash-and-wearable duds 
Means style that lasts through tubs of suds, 


li not shrink more tnan 1, by the Government's tangara et. 


Fig. 79 A negative appeal used repeatedly and successfully by the same 
company. In the issue of Life in which it appeared, this ad ranked first 
on the basis of “read most” women readers per advertising dollar. 


CCourtesy of Cluett, Peabody, & Co.) 
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vertised. Unless supported by other ads that do contain appeals, reminder ads 
would probably have little effect on sales. 

Another type of ad that contains no appeal is the institutional ad. Although 
showing the name of the company, such ads make no mention of the merits 
of any products. Their object is to advertise the company as a whole, rather 
than any specific product. The ad may publicize some worthy cause, or it 
may feature the company's own beneficent activities. Insofar as the name of 
the company is included, such ads serve a reminder function. Their principal 
effects on product sales, however, presumably stem from the association of 
desirable feeling tones with the company and its products. If the company is 
perceived as public-spirited, farsighted, and generally on the side of Truth- 
Goodness-and-Beauty, people are likely to feel favorably disposed toward its 
products. The implications of this type of advertising will be seen more clearly 
in the light of the discussion of feeling tone in the next section, as well as 
the discussion of corporate image in the last section of this chapter. 


FEELING TONE 


Nature of Feeling Tone. While the appeal in a printed ad or sales talk 
tells the consumer explicitly why he should purchase a commodity, other aspects 
of advertising may influence his reaction to the product in more subtle ways. 
Such effects have been variously described as “feeling tone,” “atmosphere,” 
“mood,” and “indirect suggestion.” Their operation can be illustrated by an 
early experiment (28). Interviewers made a house-to-house survey of hosiery 
preferences in one city. Four identical samples of hosiery were employed, 
three of which were packed in boxes containing a card of perfumer's blotting 
paper with faint perfumes. The four boxes were displayed in different parts 
of the room so that odors would not conflict. Each housewife was asked to 
examine the hosiery and indicate which was the best quality. Respondents 
apparently considered such characteristics as texture, weave, wearing quality, 
weight, and sheen, although all four pairs of stockings were identical in these 
respects and in every other way. 

Tabulation of the expressed preferences showed that 5o per cent chose the 
stockings with a narcissus odor as being the best quality, while only 8 per 
cent chose those with no perfume. By chance, 25 per cent should have chosen 
each pair. Only 6 of the 250 respondents commented on the scent. Thus it 
appears that the perfume could engender a pleasant feeling tone toward the 
product even when subjects did not specifically notice it. This experiment also 
illustrates the spread of feeling tone beyond its immediate source. The fact 
that the housewives liked the perfume made them judge stockings more 
favorably. 

A pleasant or unpleasant feeling tone aroused by any element of an adver- 
tisement may influence the consumer’s reaction to the product. For this reason, 
the advertiser must inquire into the feeling tones associated with all aspects of 
the ad. It is relatively easy to recognize the role of illustrations in associating 
appropriate feeling tones with the product. In advertising soup, for desee 
pictures can vividly suggest the glowing health of happy children, or the 
warmth and comfort of piping-hot soup on a blustery day, or the cae 
of an elegant dinner party. Other less obvious features of the ad may also 
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arouse strong feelings that will spread to the product. An effective appeal in 
the ad's primary message may be worthless if an undesirable or inappropriate 
feeling tone is evoked by the colors, the printing type, or even a border design 
surrounding the ad. If the appeal is directed to elegance and daintiness while 
the printing type suggests cheapness and strength, the effects will clash and 
tend to cancel out rather than reinforcing each other. 

Color. The association of colors with different feeling tones is widely 
recognized. Studies on the suitability of different colors for advertising various 
commodities were among the earliest conducted in the psychology of advertis- 
ing (12; 43, pp. 452-462). Apart from its possible attention value (see 
Chapter 12) and its realism in portraying the commodity, color may arouse 
strong affective associations. Through common experiences, most people have 
learned to associate red with fire, blood, and danger; blue with cool riv: rs 
and lakes; orange and yellow with sunlight and comfortable warmth, and so <2. 

In one investigation (56), 94 college students were shown eight colors and 
were given a list of 11 “moods” described in such terms as: exciting, stimulat 
ing; distressed, disturbed, upset; calm, peaceful, serene; ‘powerful, strong, 
masterful. The instructions were to name the one color that best represented 
the feelings described by each group of words. For each of the 11 moods, the 
choices showed a significant predominance of certain colors. For instance, for 
exciting and stimulating, 61 subjects chose red. For tender and soothing, 41 
chose blue and 24 green; none chose red. Repetition of this experiment with 
different groups indicated that certain color associations, including those cited 
above, are quite consistent. (36). Others vary with the demographic charac- 
teristics of the population, particularly its socioeconomic level. It should be 
added that changing the saturation or brightness of colors will considerably 
modify their associated feeling tones. Deep red and pink or navy and sky blue 
would hardly be expected to arouse the same feelings. 

In a special experiment designed to test the common association of temper- 
ature with color, it was found that subjects thresholds of "uncomfortable 
warmth were unaffected by color of illumination (4). Nevertheless, when 
later questioned about the “amount of heat transmitted” by the different lights 
under which they had worked, the subjects judged the green and blue lights 
to be cooler than the white, yellow, or amber lights. 

Printing Types and Other Graphic Elements. The printing type used in 
brand name, headline, or advertising copy should be legible, attractive, and 
appropriate. In a book, poor legibility of type may decrease speed and accuracy 
of reading and increase visual fatigue, But in an ad poor legibility reduces the 
motivation to read further. Even more important is the fact that the frustration 
experienced in trying to read such material will cause annoyance that may 
spread to the product. P 

Psychologists have done considerable research on the legibility of type Csee 
7, 40, 51, 52). Among the criteria employed to measure legibility are accuracy 
of perceiving letters or words in brief tachistoscopic exposures; Teading speed 
and comprehension for long passages; distance at w 
sentences can be correctly read; indices of eye fatigue after continuous reading; 
and number and nature of eye movements made during reading. With more 
difficult reading, there is an increase in the number and duration of eve fixa 
tions and in the number of retracing move í 


hich letters, words, or 


ments. Eye movements are usually 


— 
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recorded with an eye camera, although increasing use is being made of electro- 
oculograms and other types of electrical recording. 

Through all these techniques, it has been demonstrated that printing types 
vary significantly in legibility. Simple types are generally read faster than 
Old English or other types with elaborations and curlicues. The use of a type 
that can be read quickly and with little strain is particularly desirable in the 
copy of an ad, although some minimum standards of legibility should also 
be met in headlines and brand names. The relative legibility of different types 
varies somewhat with other factors, such as size, boldness (thickness of lines), 
width of margins, and interlinear spacing (7, 40). 

The use of colored inks and colored backgrounds also affects legibility, 
chiefly by altering the degree of brightness contrast. Fastest reading and fewest 
eye fixations have been found with black print on a white background. About 
equally good are such combinations as green on white, blue on white, and 
black on yellow. At the other extreme are such combinations as red on green 
and black on purple. In one study, the latter combination slowed down read- 
ing rate by more than 50 per cent (4o, p. 120). 

Another factor that influences legibility is familiarity of both type and 
arrangement. Any unfamiliar feature tends to slow down reading. It is partly 
for this reason that a passage or headline printed in lower case is read at a 
faster rate than one set entirely in capitals (40, 41). The only advantage of a 
headline set in capitals is that it can be read at a greater distance than one 
set in lower case, This advantage is pertinent to the designing of billboards and 
road signs. 

Familiarity of layout also facilitates reading. Bizarre arrangements of printed 
matter, sometimes used by advertisers as a means of gaining attention, may be 
self-defeating because of the frustration and annoyance they engender, Even 
certain layouts that have been found to be superior in tachistoscopic experi- 
ments yielded poorer results in the reading of long passages than did the less 
efficient but familiar arrangement of words along horizontal lines (8). With 
adult readers who have established certain reading habits, any change in lay- 
out of printed matter will increase reading difficulty, even when the new layout 
is better adapted than the old to the characteristics of human vision. — 

Besides legibility, the advertiser should consider the emotional reactions and 
associations aroused by different printing types. Decided preferences for dif- 
ferent types have been found, and these preferences change with time (7). The 
types popular a generation ago appear quaint and old-fashioned today. That 
printing types arouse distinctive feelings has been repeatedly demonstrated. 
When subjects are asked to explain their likes or dislikes for different types, 
they volunteer such descriptive terms as “whimsical,” “genteel,” “aristocratic, 
“restful and soothing,” or “intellectual” (7, p. 40). Advertisers have long tried 
to utilize these reactions by choosing types that express feeling tones appro- 
priate to their products and to the appeals they wish to convey. Figure 8o 
illustrates a variety of types used in brand names to express such diverse 
qualities as elegance, dependability, dignity, informality, sturdiness, and so on. 

In several early investigations, groups of subjects were given the names of 
abstract qualities and of commodities and were asked to rank printing types 
for their degree of appropriateness for each (10, 45, 48). The results show 
considerable agreement among individuals and give evidence of good internal 


CONSUMER PSYCHOLOGY 
294 


Keen's 0. HENRY'S 


ENGLISH CHOP HOUSE STEAK HOUR 


€7—.--4-. BETHLEHEM STEEL 


TIFFAN&(O. Western Electric 


Old Buttons, m. 


JAPANESE RESTAURANT 


Hil Frere 


IMPERIAL 
oulon 


Fig. 80 Printing types that arouse widely different feeling tones. 


PHOENIX MUTUAL 


consistency. For instance, when types were independently judged for their 
suitability in expressing cheapness and in expressing luxury, the two rank 
orders correlated — .94. The corresponding correlation for luxury and dignity 
was +.96, for cheapness and strength +.82, and for dignity and cheapness 
— 84. Rank orders assigned to the types with regard to their appropriateness in 
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advertising various commodities also revealed considerable correlation. Thus 
the rank orders for jewelry and perfume correlated --.94, while those for 
jewelry and building materials correlated. —.95. These correspondences are 
especially noteworthy in view of the fact that in this study a different group 
of subjects ranked the printing types for each quality and each commodity 
45). 

That feeling tones may be associated with such simple stimuli as lines was 
demonstrated in another early study (44). A total of 500 subjects were given 
a printed sheet containing 18 short lines, as well as a list of adjectives describ- 
ing 13 classes of feelings, such as sad, merry, agitating, and powerful. The 
subjects were instructed to choose the one line that best expressed each of the 
13 classes of feelings. The lines were drawn so as to represent different com- 
binations of three variables: (1) direction (horizontal, upward, downward); 
(2) form (curved or angular); and (3) "rhythm," or number of waves per inch 
Cslow, medium, fast). The results again showed considerable agreement among 
subjects. Thus for "sad," 86.3 per cent chose a curved rather than an angular 
line, 84.3 per cent chose a downward-sloping line, and 86.8 per cent chose a 
slow rather than a medium or fast rhythm. For “powerful,” 85.1 per cent chose 
an angular line, 55.5 per cent chose an upward-sloping line, and 52.7 per 
cent chose a medium rhythm. 

Language. ‘That distinct feeling tones can be aroused by sounds as well as 
by shapes is illustrated in Figure 81. Given the two meaningless forms and 
the two meaningless "words" shown in this figure, the vast majority of persons 
will associate the word containing the ] and m sounds with the gently curving 


Fig. 81 Feeling tones associated with forms and sounds. 


Which of these shapes is Lamolay 
and which is Taratok? 
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shape, and the word containing the t, r, and k sounds with the sharply angular 
shape. The feeling tones aroused by different sounds should be considered in 
devising brand names, as well as in selecting appropriate words for advertising 
copy (see, e.g, 23). In the choice of a brand name, meaningful associations 
should also be investigated, preferably in a consumer survey. Phonetically, onc 
of the most beautiful combinations of sounds in the English language is 
"cellar door," but this name would hardly be suitable for a new perfume. 

Another important factor in the success of a brand name is ease of pro- 
nunciation. Just like type that is difficult to read, a brand name that is difficult 
to pronounce tends to arouse annoyance and hostility. Because of the possible 
embarrassment resulting from mispronouncing the name, moreover, many cus- 
tomers will hesitate to ask for the product and will buy a more pronounceable 
competing brand instead. For mass-market items, such brand names as Baume 
Bengué, Djer-Kiss, Pro-phy-lac-tic, Nescafé, and Cinzano put an unnecessary 
burden on the consumer. 

"There are a number of instances in which a manufacturer became aware of 
the sales handicap resulting from a hard-to-pronounce brand name and did 
something about it. The makers of Baume Bengué, for example, began to 
insert "pronounced Ben Gay" under the name. Later, the name was changed 
to Ben Gay altogether. Another classic example is that of Suchard chocol: 
(17). Recognizing the difficulties that American consumers were having in 
pronouncing this name, the company began to print the slogan "Say Soo 
Shard" on the wrapper of each chocolate bar. This in turn led to the decision 
to adopt a live trademark in the form of an attractive little girl who was to be 
known as “Sue Shard.” Eventually this little girl was featured in all advertise- 
ments for the company, through photographs, comic strips, radio commercials, 
and personal appearances. 

Apart from word choice, many other aspects of language serve to arouse 
feeling tones that may be pleasant or unpleasant, appropriate or inappropriate. 
A passage consisting of short sentences, with a choppy, jerky rhythm, for exam 
ple, is likely to suggest pep, sparkle, and liveliness; one written in longer 
sentences, with strings of modifiers and a slow, languorous rhythm, is more 
likely to suggest relaxation and reverie. The imagery evoked by language also 
needs to be considered. To say that "evaporated milk makes a fine curd in the 
stomach and is therefore more digestible than regular milk" could be factually 
correct, but it arouses imagery that is unappetizing. The whole question of 
ease of understanding and “readability” is also of some relevance to feeling tone 
but can be more appropriately discussed in Chapter 12. 


MOTIVATION RESEARCH 


Finding out why people buy a certain product—or why they do anything, for 
that matter—is difficult. In answer to a direct question about the reasons for 
their choices or actions, people are likely to give rationalizations or "socially 
desirable” answers which may differ from their real motives. It was to meet 
these difficulties that the various techniques loosely grouped under “motivation 
research” were introduced into consumer psychology. The potentialities of 
motivation research fired the imagination of advertising men and of the general 
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public alike. Articles and books soon appeared in the popular press, extolling 
its virtues (e.g, 32) or lambasting it as a dire threat to our privacy of thought 
and our freedom of action (e.g. 39). Fortunately, sober and well-balanced 
surveys of these techniques are also available (15, 19, 50). 

The data-gathering procedures included under motivation research are many 
and varied. Some writers use the term so broadly as to cover any application of 
the methods of psychology, sociology, or anthropology to consumer problems. 
In its more usual, restricted sense, motivation research refers to the use of in- 
tensive, disguised, qualitative, clinical procedures in the study of buying 
motives. One of these techniques is depth interviewing, which may itself vary 
widely in the hands of different investigators (11; 5o, chs. 3-5). At one ex- 
treme we find a fairly structured approach, with predetermined opening ques- 
tions followed by nondirective probes. At the other extreme is psychoanalytic 
interviewing designed to explore emotional contexts, symbolic meanings, and 
fantasies that may be associated with a product; the origins of these associations 
are typically tracked down to early childhood experiences. Another variant is 
the group interview, which may be conducted either with a natural group such 
as a family or club or with a specially assembled consumer panel (5o, ch. 6). 
This situation gives the respondents an opportunity to interact with each other, 
as well as to react directly to questions, sample products, demonstrations, films, 
or other stimuli presented by the interviewer. 

Motivation research has also made extensive use of almost every available 
kind of projective technique (50, chs. 7-14). Among those most commonly 
employed may be mentioned both free and controlled word-association tests, 
sentence completion, and picture interpretation tests patterned after the 
Thematic Apperception Test. A number of studies have used cartoons in which 
the subject must supply the speech of one of the characters, as in the Rosen- 
zweig Picture-Frustration Study. One such cartoon, used in a cigar survey, 
portrayed a man coming home from work and announcing to his wife, "I've 
decided to take up smoking cigars, dear.” The wife's reply was to be filled in by 
the respondent (50, pp. 120-121). Such tests try to capitalize on the fact that 
a respondent is often willing to put in the mouth of a third person an opinion 
that he himself holds but is unwilling to express. The same idea can be 
utilized by merely wording an ordinary question in the third person. Instead 
of asking, “What do you think are the objectionable features of this cleanser? 
we can say, “Some women who use this cleanser find a lot of faults with it. 
I wonder if you can guess what they are objecting to” (50, p. 104). 

Another common technique is role playing, or visualization (19, pp- 74- 
79; 5o, ch. 10). This technique was employed in a widely quoted study of the 
attitudes of housewives toward instant coffee (18). Although some form of 
powdered instant coffee had been on the market for many years, it had not been 
widely adopted at the time of this study. When directly questioned about their 
reasons for not using instant coffee, most housewives said that they did not 
like the flavor. Suspecting that this might be a rationalization, the investigator 
showed other samples of housewives a shopping list, with the request that they 
try to project themselves into the situation and describe the personality of the 
woman who bought the groceries. One group of 50 housewives was given list 
A; another group of 5o was given list B. As shown below, the two lists were 
identical except for the fifth item: 
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List A List B 

1% Ib hamburger 1% Ib hamburger 

2 loaves Wonder bread 2 loaves Wonder bread 

bunch of carrots bunch of carrots 

1 can Rumford's Baking Powder 1 can Rumford's Baking Powder 
Nescafé instant coffee Maxwell House coffee, drip grind 
2 cans Del Monte peaches 2 cans Del Monte peaches 

5 Ib potatoes 5 Ib potatoes 


The descriptions obtained with the two lists were distinctly different. Among 
the women shown list A, which contained instant coffee, 48 per cent charac- 
terized the shopper as lazy, and an equal percentage said she did not plan her 
purchases well; 12 per cent called her a spendthrift; and 16 per cent said she 
was not a good wife. In the group shown list B, which contained a well-known 
brand of ground coffee, laziness was mentioned by only 4 per cent and bad 
planning by 12 per cent; no one in this group described the shopper as a 
spendthrift or said she was not a good wife. Adding a pie mix to both shopping 
lists, which were used with two new samples of housewives, brought the 
personality descriptions of the regular-coffee buyer closer to those of the instant- 
coffee buyer. Evidently it was the inclusion of a prepared food item that 
evoked the stereotype. 

In still another part of the same study, responses to the original instant: 
coffee list were obtained from users and nonusers of instant coffee, as de- 
termined by a pantry check. The personality descriptions revealed several 
significant differences between the two groups of housewives. For example, 
70 per cent of users and 18 per cent of nonusers characterized the shopper as 
economical; 29 per cent of users and no nonusers described her as a good 
housewife who planned well and cared about her family; 16 per cent of users 
and 55 per cent of nonusers said she could not cook or did not like to; and 
19 per cent of users and 39 per cent of nonusers called her lazy. 

Similar results were obtained by a simpler method in a Belgian survey re- 
garding the use of soup mixes (19, p. 99). Three groups of housewives, class- 
ified as regular users, infrequent users, and nonusers, were asked, “For which 
among these six listed reasons do you think a woman might use soup mixes?” 
Only two of the answers were chosen with significantly different frequencies 
by the three groups. “She thinks soup mixes are as good as home-made soups” 
was checked by 73 per cent of regular users, 48 per cent of infrequent users, 
and 21 per cent of nonusers. On the other hand, “She is lazy” was the reason 
given by 35 per cent of regular users, 47 per cent of infrequent users, and 60 
per cent of nonusers. It seems that fear of being considered lazy may be 
a common deterrent among nonusers of instant foods. 

The reverse approach was followed in a British study of gasoline (or petrol, 
as it is called in Britain). Motorists were shown photographs of six men, of 
whom two were chosen as young and sporty types, two as solid and respectable 
citizens, and two as depressed and downtrodden types (19, pp. 78-79). Each 
respondent was told, “Here are some photographs of typical motorists. Would 
you look at them and tell me what sort of petrol you would guess that each 
of them habitually uses?” Only 3 per cent of those questioned said they could 
not tell from the photographs. The replies given by the other respondents 
revealed strong associations between certain brands and one or another of the 
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given types, as can be seen in Table 16. Brand A seems to be more closely 
associated with the young and sporty type, Brand B with solid citizens. 


TABLE 16 Stereotypes Associated 
with Different Brands of Gasoline 
( From Henry, 19, p. 79) 


Percentage ascribed to 
Gasoline Young and Solid Depressed, 
Brand sporty citizens downtrodden Total 
A 61 29 10 100 
B 26 51 23 100 
c 34 35 31 100 


Motivation research can serve as a source of hypotheses, which can sub- 
sequently be tested by objective, quantitative, large-scale procedures. It is thus 
most useful in the preliminary, exploratory stages of a consumer study. As a 
by-product, it can also suggest new advertising appeals to the copywriter. The 
effectiveness of ads prepared on this basis, however, needs to be determined 
empirically, as for any other ads. Even when several techniques are combined 
in the same study—as is often done—motivation research cannot provide con- 
clusive answers about consumer behavior. Because of the many limitations of 
this research, its findings need to be verified by more objective procedures. 

Most motivation research is conducted on small and unrepresentative sam- 
ples. Depth interviewing, as well as certain projective techniques, require ex- 
cessive expenditure of time on the part of trained investigators. The cost per 
case thus makes these procedures unsuitable for large-scale surveys. At the 
same time, the type of information gathered in motivation research is subject to 
wide individual differences. The emotional associations and motives investi- 
gated by motivation research are often of a highly personal nature and cannot 
be readily generalized. Once certain behavior patterns or feelings have been 
identified, it is necessary to determine what proportion of consumers in large, 
representative samples react in a similar way toward a given product. 

Projective techniques of the sort used in motivation research have generally 
yielded little or no evidence of validity in personality studies. We certainly 
cannot assume that they provide valid indices of buying motives. In this area, 
as in personality testing, their validity remains to be demonstrated. The fact 
that in depth interviewing or projective testing the respondent may talk at 
great length does not necessarily mean that what he says is deep, nor that it is 
relevant to the behavior we are trying to predict. t 

In visualization, or role-playing, studies, such as those dealing with instant 
coffee and gasoline, we cannot be sure that the associations between product 
use and personality stereotypes are as strong or prevalent as the results suggest, 
nor that they actually affect buying behavior (see 22). When the interviewer 
asks subjects to give a personality description from a shopping list, for instance, 
he is forcing them to hunt around for any stereotype they can think of in the 
effort to comply with his request. Sad as it may be, few respondents have the 
self-confidence and courage to tell the interviewer that it can t be done. It 
is thus possible that many of the housewives who described the instant-coffee 
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buyer as lazy did so only because they thought that was what the interviewer 
wanted them to say. The results merely show an awareness of the stereotype. 
They do not indicate its prevalence or strength. The fact that the undesirable 
stereotype was mentioned by more nonusers than users could mean simply that 
respondents are less likely to express disapproval of something that they them- 
selves do. But this does not tell us what is cause and what is effect. 

A common danger in motivation research is that the results may reflect 
the investigator's preconceived notions. Because of the uncontrolled and sub- 
jective nature of some of the techniques, it is virtually impossible to rule out 
the investigator's bias. The farfetched psychoanalytic interpretations offered 
by some practitioners of motivation research have little basis in verifiable facts. 
Under these conditions, too, it sometimes happens that different motivation 
researchers make inconsistent or opposite recommendations regarding the same 
product (see, e.g., 6, pp. 228-229). 

It should also be recognized that motivation research is not new. It repre- 
sents the application to consumer research of techniques that had long been 
in use in clinical psychology. INor is motivation research the only way to study 
buying motives. If used with appropriate experimental designs, direct ques- 
tioning can often yield the required information. Instead of asking a subject 
why he does something—which is a pretty big order anyway—we can ask him 
direct questions about what he does, how he does it, and when. Thus we could 
ask how long the respondent has been using his current brand of dentifrice, 
what other members of the family use it, what he disliked about his former 
brand, and so on. Or we can ask at what times a subject smokes more than 
usual and at what times less than usual, when he enjoys it most and when 
least, etc. (see 5o, pp. 213-220). 

To be most fruitful, questions should be formulated so as to test specific 
hypotheses. Sometimes the cross tabulation of replies to different questions 
can yield information that could not be obtained from an analysis of single 
questions. A good example can be found in an automobile survey conducted 
by Politz (46). The investigator hypothesized that the average motorist’s judg- 
ment of the “pickup,” or acceleration, of his car may be quite inaccurate and 
may be influenced by an irrelevant cue, namely, the amount of resistance en- 
countered when pressing down the gas pedal. To test this hypothesis, he asked 
motorists two questions. In one part of the interview, the respondent was asked 
whether his car had good pickup. In another, he was asked about the ease of 
handling the car, the gas pedal being mentioned along with steering wheel, 
brake, and other controls. Engineering data about the cars indicated that stiff 
and soft accelerator springs occurred in about the same proportion in car 
models with high acceleration as in those with low acceleration. The survey 
replies showed, however, that, among motorists who reported difficulty in 
pressing and keeping down the accelerator, only 26 per cent said their car had 
good pickup; among those who reported that it was easy to press and keep 
down the accelerator, 61 per cent described their car as having good pickup. 
Softness of accelerator spring was thus perceived as additional horsepower- 

In commenting upon this procedure, Politz wrote, ^. . . the illusion that 
the interview has to obtain a ‘true’ answer from a given respondent is dispensed 
with; the answers are reactions and all we need to do is to observe and measure 
them as reactions" (46, p. 122). It is interesting to note that this is the ap 
proach followed in the development and use of most personality inventories 
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Csee 2, pp. 520-521). The questions in such inventories are not designed to 
elicit factual information but are treated as verbal stimuli to which subjects 
respond. The extent to which each response correlates with other behavioral 
indices is then empirically investigated. In discussing the construction of the 
Minnesota Multiphasic Personality Inventory, for example, Meehl wrote 


(33; P- 9): 


. . the verbal type of personality inventory is not most fruitfully seen as a “self- 
rating" or self-description whose value requires the assumption of accuracy on the part 
of the testee in his observations of self. Rather is the response to a test item taken as an 
intrinsically interesting segment of verbal behavior, knowledge regarding which may 
be of more value than any knowledge of the "factual" material about which the 
item superficially purports to inquire. Thus if a hypochondriac says that he has 
"many headaches" the fact of interest is that he says this. 


This objective, empirical approach is well worth trying in consumer research as 
well. 


PERSONALITY STUDIES OF CONSUMERS 


Knowledge about the characteristics of various consumer publics has been 
one of the objectives of consumer research from its earliest beginnings. Media 
research provides information on demographic characteristics of the audience 
reached by specific newspapers, magazines, and radio or television programs. 
Armed with this information, the advertiser can choose the medium most 
suitable for his product and can adapt the appeal, wording, and other features 
of his ad to the particular audience. Thus food ads will be more successful if 
placed in a magazine read by housewives. Ads for a single product, such as 
cigarettes, may be very differently expressed for a college paper, a trade journal, 
or a general family magazine. Within one medium, placing book ads on the 
book review page not only selects the persons interested in books but reaches 
them at a time when they are specifically oriented toward books. 

To choose his medium effectively, as well as to prepare a successful ad, the 
advertiser needs to know as much as possible about the consumers who use 
his product. Beginning with such simple demographic variables as sex, age, 
and socioeconomic level, consumer studies have been extended to include 
personality characteristics. Motivation researchers have used depth interviewing 
and projective tests to find out how users of a product or brand differ from 
nonusers (11; 50, pp. 156—158). Other investigators have employed personality 
inventories for the same purpose. Personality factors have been studied in 
relation to such consumer variables as motion-picture preferences (49), amount 
of cigarette smoking (14), being overweight and hence a potential buyer of 
reducing products (27), and listening to soap operas (21). f | ‘ 

An exploratory study of magazine readership used a special adaptation o 
the Allport Vernon-Lindzey (A-V-L) Study of Values designed to fit the 
vocabulary and general information level of readers of mass-circulation maga- 
zines (13). The test was administered to 300 men and women in six cities. 
When subjects were classified according to their expressed interests in different 
types of fiction and nonfiction articles, many significant differences in A-V-L 
score patterns were found. In another study (53), 133 male business college 
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students were given the Gordon Personal Profile and a disguised questionnaire 
containing questions on the use of various products, such as headache remedies, 
cigarettes, chewing gum, and deodorants. A number of low but significant re- 
lations were found between product use and personality test scores. For ex- 
ample, the use of headache remedies correlated negatively with ascendancy and 
emotional stability; the acceptance of new fashions correlated positively with 
ascendancy and sociability. 

An unusually extensive and wellcontrolled study of consumer personality 
was conducted with the nationwide, representative consumer panel of the 
]. Walter Thompson Company (25, 26). The Edwards Personal Preference 
Schedule (EPPS) was administered to 8,963 adult men and women who were 
classified as heads of households. The EPPS, which was described in Chapter 
3, is designed to measure the relative strength of 15 needs, such as achieve- 
ment, affiliation, dominance, or change. In this study, significant mean dif- 
ferences in several needs were found among subgroups classified according 
to such demographic variables as age, sex, annual income, city size, and United 
States region. Such findings provide the advertiser with useful information 
about consumers in different markets. 

A major part of the analysis consisted in the comparison of EPPS scores of 
consumer subgroups classified according to reported purchase of various com- 
modities, such as cars, cigarettes, cosmetics, and paper towels. Subscribers to 
different magazines were similarly compared. Illustrative results obtained with 
male cigarette smokers and nonsmokers are given in Figure 82. The EPPS 


Fig. 82 Mean scores on Edwards Personal Preference Schedule obtained 
by male cigarette smokers and nonsmokers. (Adapted from Koponen, 26, p. 9.) 
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scores are expressed as percentiles based on adult male norms. The five scales 
listed on the graph yielded mean differences between nonsmokers and heavy 
smokers that were significant at the .o1 level. The heavy smokers scored 
higher in expressed needs for sex, aggression, and achievement, while the non 
smokers scored higher in the needs for order and deference, or compliance. 
Further breakdown of results showed that smokers of filter cigarettes scored 
significantly higher than nonfilter smokers on dominance, change, and achieve- 
ment; the nonfilter smokers scored significantly higher on aggression and au- 
tonomy, or the need for independence. 

In still another part of the same project, mail order purchases made in re- 
sponse to different appeals were checked against EPPS scores. In each case, 
two groups were chosen from persons who scored at opposite ends of the 
distribution in the need under investigation but who were matched in such 
variables as age, income, and city size. A sales promotion piece was deliberately 
written to appeal to individuals high in that particular need. These advertise- 
ments were mailed to the members of both groups, and the resulting sales in 
the two groups were compared. A mail order ad written around the appeal for 
change, for example, brought in over twice as many returns from the group 
scoring high on the need for change as from the group scoring low in this 
need. In general, these validation studies revealed significant but low relation- 
ships between EPPS scores and buying behavior. Other factors, such as price, 
product characteristics, and nature of the offer proved to be more important in 
determining frequency of returns. To be sure, the influence of personality 
factors on buying behavior may vary with the product. But the data so far 
available suggest that this influence is generally small. 


PRODUCT IMAGE 


The terms “product image" and “product personality" refer to the composite of 
feeling tones and ideas associated with a product. Tea, for instance, was tradi- 
tionally associated with old ladies, invalids, delicate china, and the affectations 
of the “tea shoppe.” In the effort to change this image and to appeal to a wider 
public, advertisers began to feature pictures of young people and of rugged- 
looking men drinking tea for relaxation after work or sports Csee Figure 83). 
For the same reason, such terms as “brisk,” “robust,” and “hearty have been 
used more and more to describe tea in advertising copy. à 

Product image may also refer to a specific brand. Marlboro cigarettes were 
originally considered elegant and somewhat feminine. In the effort to alter 
this brand image, the company undertook an advertising campaign in which 
ranchers, hunters, and other sturdy outdoor types were shown smoking Marl- 
boros and invariably displaying a tattoo mark. The previously cited British 
study indicated that different brands of gasoline have characteristic brand 
personalities. One was associated predominantly with young and sporty types, 
another with solid, respectable citizens. i 

Brand names and trademarks are often chosen so as to strengthen the desired 
brand image. The Campbell soup kids, Aunt Jemima, and Elsie the Borden 
cow are familiar examples. Of particular interest is the story of Green Giant 
peas (see 9, pp. 86-89). A small company in Ls acum ud 
Valley Canning Company—began to market canned peas grown from a see 
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that produced extra large peas. The company chose the brand name Green 
Giant and displayed a picture of a smiling green giant on the cans and in all 
advertisements of this product. Soon the green giant began to appear also on 
other canned goods prepared by this company, such as corn, and some of the 
other brand names were changed accordingly. Recognizing the popularity 
and effectiveness of this symbol, the company itself eventually changed its 
name to the Green Giant Company. 

The Green Giant story illustrates a deliberate effort to transform a success 
ful brand image into a corporate image. The reverse process is more common. 
In a well-established company, the strength of the corporate image increases 
public acceptance of each of its products. In some situations, however, it is 
better not to identify all products with a single corporate image. If a company 
makes both breakfast food and fertilizer, it would undoubtedly be desirable to 
have different brand names for the two lines and to keep the brand images 
distinct. 

Even when two very similar products are involved, the use of a single brand 
name may be undesirable when the products are designed for different con- 
sumer publics or when some of their characteristics might conflict. This is 
illustrated by what may be regarded as a postscript to the Green Giant story 
recounted above. The same company is now marketing a type of canned peas 
whose principal feature is that they are small. Typical advertising copy for 
these peas asserts that “Their small size tells you, the minute you open the 
can, that these are not ordinary peas. . . . They have a subtle sweetness . . . 
that rivals the famed petits pois of France” (29). Under these conditions, it 
is understandable that the Green Giant company chose a different trade name 
for these peas. 

Much of what was formerly described under the headings of “public rela- 
tions” and “good will” has now been brought together under the concept of 
“corporate image” or “corporate personality” (see 5). Institutional advertising, 
mentioned in the first section of this chapter, is one of the recognized methods 
of building good will and developing a favorable corporate image. But to be 
effective, a corporate image needs to be much more than merely favorable—it 
must be specifically related to company objectives. A corporate image might, 
for example, suggest scientific advancement, old-fashioned goodness and de- 
pendability, elegance and distinction, or sturdiness and vigor. To combine or 
exchange these images might prove disastrous. A corporation cannot be all 
things to all people. It needs to formulate its objectives clearly and then set 
about developing a corporate image that fits these objectives. Conflicting and 
contradictory impressions tend only to confuse and weaken the corporate image. 

To be sure, a corporation will be somewhat differently perceived by its 
different publics, including stockholders, employees, suppliers, distributors, 
customers, government officials, the press, and the communities in which it 
operates. Although recognizably different, these various corporate images will 
have a common core and will tend to influence each other. If a company’s 
plants spread soot over the townspeople’s homes or pollute their streams, these 
actions become a part of its corporate image. Advertising cannot do the whole 
job. The corporate image is affected by every aspect of a company’s operation, 
from the quality of its products and the nature of its employee relations to the 
appearance of its buildings and grounds and the printing type used on its 
letterheads. 
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Nearly all techniques of consumer research have been used to investigate the 
public images of products, brands, or corporations (57). Not only direct ques- 
tioning, but also depth interviewing, projective techniques, and other pro- 
cedures of motivation research have been employed for this purpose. Motiva- 
tion researchers have, in fact, done much to popularize the terms "product 
personality" and "corporate personality." Two relatively objective techniques 
that are well suited for image research are the Semantic Differential and the 
adjective checklist. 

Originally developed for basic research on the psychology of meaning, the 
Semantic Differential has proved to be a useful tool in a wide variety of fields, 
from clinical psychology to advertising (38). Since the Semantic Differential 
provides a standardized and quantifiable procedure for measuring the connota- 
tions of any concept for the individual, it is particularly applicable to the study 
of product images and related problems (3, 24, 35, 57). Specifically, the respon- 
dent rates the product Cor brand or company) on a seven-point graphic scale 
for each of a series of bipolar attributes, such as strong-weak, warm-cold, active- 
passive, friendly-unfriendly. Factorial analyses of the ratings for many different 
qualities have led to the identification of three principal factors: evaluative 
Ce.g., good-bad, valuable-worthless), potency (e.g., strong-weak, large-small), 
and activity (e.g., active-passive, fast-slow). The evaluative factor generally 
accounts for the largest percentage of the total variance (38). 

For each product investigated by means of the Semantic Differential, we 
can plot a “product profile” giving its mean ratings on the qualities rated. We 
can also find the total product score on each of the three factors or on all 
factors combined. The over-all difference between the profiles of two products 
can be expressed in terms of a composite index (37). In the study of corporate 
images, respondents are often asked to rate the ideal company on the same 
qualities on which they rate specific companies. The actual and ideal images 
can thus be compared by examining the profiles or by computing a total 
difference index. 

The adjective checklist provides a simple basic procedure that has been used 
with a number of variations. Respondents are given a list of adjectives or de- 
scriptive phrases and asked to check those that apply to the particular product 
or company. Some of the variants of this technique are designed chiefly to 
dramatize the procedure and make it more interesting for the respondents 
Csee 57). One modification is the chessboard technique, in which each ad- 
Jective is printed on an individual square of the “chessboard,” The respondent 
checks all the squares that apply to the object he is rating. Another variant 
uses pictures to represent the extremes of each quality to be rated. 

A forced-choice variant of the adjective checklist was employed in an in- 
vestigation of the product personality of different cars ( 54, 55). The first part 
of the study was conducted in 1956, before the introduction of the 1957 
models. A list of 108 adjectives was presented to 100 undergraduate men, with 
the instructions that for each adjective they indicate which of three types of 
car owners it described best. The subjects recorded their responses by placing 
a check mark in one of three columns headed “Cadillac,” “Buick,” and “Chev- 
rolet. The order of both the adjectives and the cars was rotated to avoid 
positional effects. The stereotypes associated with these three cars were clearly 
revealed in the responses. Among the traits most frequently attributed to 
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Cadillac owners were famous, important, and successful. Buick owners were 
described in such terms as middle-class, brave, and masculine. Chevrolet 
owners were characterized by such adjectives as poor, plain, and simple. 
The respondents were also asked to compare Ford, Plymouth, and Chev- 
rolet owners by means of the same checklist. Six months later, this procedure 
was repeated with a similar group of roo respondents, about 7o per cent of 
whom had participated in the first test session. In the interval, the 1957 
models of these cars had been introduced and vigorously promoted in ad- 
vertising campaigns. Table 17 shows the results obtained on the two occasions. 


TABLE 17 Change in Product Personality 


as Measured by an Adjective Checklist 
(Data from Wells, Goi, and Seader, 55, p. 121) 


Traits chosen as most typical of each car owner with frequencies exceeding chance 
expectations at .01 level, arranged in order of decreasing frequency 


FORD PLYMOUTH CHEVROLET 


1956 Results 


masculine merry quiet calm ordinary 
young loud careful sad fair 
powerful active slow thinking common 
good-looking cool silent patient 

rough tall moral honest 

dangerous interesting fat understanding 

strong sharp gentle content 

single popular 


1957 Results 


dangerous active high-class small 
loud proud feminine low-class 
rough brave important little 
powerful rich STE le 
cross different ordinary 
thin particular practical 


LM ———————————————— 


The youthful and dashing image of the 1956 Ford owner had become rougher, 
tougher, and less debonair in 1957. The Chevrolet owner appeared rather non- 
descript on both occasions. The most conspicuous change occurred in the image 
of the Plymouth owners. Of the 14 adjectives chosen with greater than chance 
frequency to characterize Plymouth owners in 1956, none remained in 1957. 
The pleasant, nice, and stodgy character associated with the 1956 models had 
now been replaced by a more glamorous and discriminating type of person. 

These changes in product personality were in line with the changes that 
had occurred both in the cars and in the advertising. Thus a typical ad for the 
1957 Plymouth asserted, “In one flaming moment, Plymouth sin three full 
years ahead—the only car that dares to break the time barrier! cacy : 
traditionally great engineering brings you the fabulous new Fury 3o1' V- 
engine . . . revolutionary new Torsion-Aire ride . . . exhilarating sports car 
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handling . . . new supersafe Total Contact brakes . . . dramatic Flight Sweep 
Styling. The car you might have expected in 1960 is at your dealer's ow!” 
(42). As the investigators aptly comment, "It is hard to believe that the 
quiet, careful, slow owner of an old Plymouth would dare go near a car with 
a Fury engine" (55, p. 121). 


SUMMARY 


The appeal is the primary message of an advertisement, whose purpose is to 
show how a commodity can satisfy some consumer want. Lists of human drives 
can serve only as a very general guide to the advertiser. New drives develop 
as a result of experience. The relative strength of appeals varies for different 
products and for different consumer populations. By providing information 
on how consumers use products and on what they particularly like or dislike 
about different products, consumer surveys help in the selection of effective 
appeals. For a number of commodities, negative appeals may be more effective 
than positive appeals. Negative appeals, however, should be distinguished 
from negative suggestion, which is usually undesirable. Although most ads con- 
tain an appeal, two well-known exceptions are reminder ads and institu- 
tional ads. 

Besides presenting explicit appeals, an advertisement can arouse the desired 

feeling tone and associate it with the product in more indirect ways. Since 
feeling tones tend to spread beyond their source, any aspect of the ad could 
lead to unfavorable, unsuitable, or conflicting associations with the product. 
Among the factors to be examined for appropriateness of feeling tone are 
illustration, color, printing type and other graphic elements, and language. 
In addition, type that is difficult to read or brand names that are hard to 
pronounce tend to produce frustration, annoyance, and embarrassment. 
; Motivation research utilizes depth interviewing and projective techniques 
in the study of buying motives. Among its many procedures are several 
varieties of intensive individual interviewing, group interviewing, word associa- 
tion, sentence completion, picture interpretation, cartoon tests modeled after 
the Rosenzweig P-F Study, and role playing, or visualization. The principal 
contribution of motivation research is to provide hypotheses for subsequent 
testing by more objective and quantitative methods applicable to large samples. 
Contrary to the claims of some motivation researchers, direct questioning can 
also yield information on consumer motivation, provided it is incorporated in 
suitable experimental designs. 

Closely related to motivation research are personality studies of consumers. 
Users and nonusers of a product or brand have been compared in personality 
characteristics as well as in demographic variables, Interviewing, projective 
techniques, and personality inventories have been employed for this purpose. 
Some exploratory research has also been conducted on the relation between per- 
sonality characteristics and actual buying response to specific appeals. 

The concepts of product image, product personality, and corporate image 
have been popularized by motivation research. Nearly all techniques of con- 
sumer research have been utilized to investigate the public images associated 
with products, brands, or corporations. Two techniques that are often applied 
in this type of study are the Semantic Differential and the adjective checklist. 
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Perceptual and Cognitive 
Factors in Advertising 


Advertising is a form of communication. In Chapter 11, we considered the 
message that the advertiser wishes to communicate, as illustrated by explicit 
appeals, feeling tones and atmosphere conducive to product acceptance, brand 
images, and corporate images. Whatever their objectives, advertising communi- 
cations must be presented in such a way that they will be noticed, understood, 
retained, and accepted. To be sure, these are necessary but not sufficient con- 
ditions for advertising effectiveness. An ad may catch the eye, arouse interest, 
present its arguments clearly and convincingly, and still not sell the product. 
On the other hand, if no one notices an ad, it cannot affect sales no matter 
how meritorious it is in all other respects. 


MECHANICAL ASPECTS OF PRINTED ADS 


Some of the earliest studies in the psychology of advertising were concerned 
with the effect of such ad characteristics as size, repetition, position, color, and 
illustration upon attention and retention. Continued research on these factors 
by a variety of methods has led to the accumulation of a considerable body of 
data that can serve as a general guide in the preparation of ads and in the 
planning of advertising campaigns. Detailed surveys of such data can be 
found in several sources (23, chs. 20-31; 29). Recent results have been derived 
largely from readership surveys of magazines or newspapers and from analyses 
of written inquiries, although eye cameras, tachistoscopes, and other laboratory 
techniques have also been employed. ; 

Readership Surveys. A particularly well-controlled study of readership 
data was published by Rudolph (40) in 1947. Using Starch readership figures 
for 2,500 half-page and full-page Saturday Evening Post ads, Rudolph analyzed 
the effect of each ad characteristic while holding other features constant. Ads 
were regularly matched for product, company, size, and use of color, unless one 
of the latter variables was itself under investigation. For example, when study- 
ing the readership of right- and left-hand pages, Rudolph compared Texaco 
black-and-white full-page ads appearing on right and left pages. Readership 
ratios found in such subgroup comparisons were then averaged for all ads. 
Rudolph’s findings have been corroborated by later Starch studies of more 
limited scope (see 44-46, 48, 51, 52) as well as by readership studies conducted 
by others. Some of the major results from these studies will be cited for 
illustrative purposes. 

Increasing the size of an ad will 
in direct proportion to size. An ad t 


generally increase its readership, but not 
hat is doubled in size will have somewhat 
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less than double the original readership. Moreover, the rate of increase in 
readership will itself depend upon the attention value of the original ad. A 
small ad that is noted by a large proportion of readers has less to gain by an 
increase in size than one noted by few readers. 

Starch (46) has pointed out that, by chance alone, if 10 out of 100 readers 
notice a half-page ad, then doubling the size of the ad should attract 10 per 
cent of the remaining go readers, or 9 additional readers. Hence the full-page 
ad should attract 19, not 20, readers. On this basis, Starch proposes the follow 
ing formula for predicting the effect of doubling ad size upon readership: 


x=N+(100—N).01N 


in which N is the percentage of readers who noted an ad of given size and x 
is the predicted percentage of those who will note an ad of twice the size. In 
simplified form, the formula reduces to: 

x = 2N — 01N? 
Table 18 shows that the percentages predicted with this formula agreed quite 
closely with those obtained empirically. The data are based on 6,150 half-page 


TABLE 18 Relation between Ad Size and Readership 


(From Starch, 46) 
Se O DOA TT G a 


Percentage “Noted” 
Obtained with Obtained with Predicted with 
Category half page full page full page 

Black-and-white ads ; 
Men 9.6 18.5 18.3 
Women 10.1 20.6 19:2 
Two-color ads 

Men 13.0 23.9 24.3 
Women 6.3 11.6 12.2 
Four-color ads 

Men 13.7 26.9 25.5 
Women 254 44.9 44.3 
Total 13.3 25.4 24.8 


Se 


and 6,604 full-page ads surveyed by Starch over a four-year period. Readership 
scores were compared for half-page and full-page black-and-white ads in the 
same product category and appearing in the same magazine. The same analysis 
was repeated with two-color and with four-color ads. 

Size of ads has been analyzed from several other points of view. One analysis 
of newspaper ad readership, for example, indicated that larger ads attract a 
greater proportion of nonusers than do smaller ads (21). This relationship is 
illustrated in Figure 84. Present users are likely to notice an ad for the product 
they use, regardless of ad size, because of their interest in the product itself. 
The nonuser, on the other hand, is more likely to respond to the mechanical 
factor of size. Small ads may thus be relatively effective in retaining present 
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Fig. 84 Relation of size of advertisement to readership by users and 
nonusers of product. (From Hadley, 21, p. 55.) 


users and possibly inducing them to buy more of the product, while large 
ads may be more effective in attracting new users. 

The additional space available in a large ad may be used in various ways. 
It is possible, of course, merely to magnify a small ad so as to fill up the 
space; but this is rarely done. In a larger ad, there is an opportunity to in 
clude more illustration and more text. Sometimes, the additional space is 
utilized simply as “white space,” or unfilled background. Ads with a large 
proportion of white space tend to attract attention because of their novelty, 
stark simplicity, and startling appearance. White space may also be used to 
create a desired atmosphere. The ad reproduced in Figure 85 illustrates a 
liberal use of blank space in combination with an arresting headline to arouse 
interest and curiosity. 

A second major factor that increases the number of readers reached by an 
ad is repetition. Each time an ad is repeated in a different issue of a magazine, 
it will be seen by a certain number of new readers, as well as by some who 
had seen it in the earlier issue. As a result, the total number of readers who 
notice the ad as well as the number who read it tends to remain approximately 
the same on successive repetitions. Moreover, the repeated exposure of those 
readers who notice the ad on more than one occasion serves to strengthen 
whatever impact the ad makes on them. à 

Starch (48) analyzed the readership data for 8o ads that had been inserted 
from two to eight times unchanged in different issues of the same magazine. 
The intervals between insertions ranged from two weeks to several months, 
the most common interval being one month. The results showed that each ad 
tended to retain the same readership level through all repetitions. Those ads 
with low initial readership were noted by a small proportion of readers on 
successive repetitions; those with high initial readership were noted by a large 
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Fig. 85 Use of white space as an attention-getting device. 
CCourtesy of Rogers Corporation.) 


proportion of readers on each repetition, Such findings suggest the desirability 
of using successful ads repeatedly. Similar results have been obtained in other 
studies with business magazines (32) and with outdoor poster advertising (49). 

A somewhat different procedure was followed in a well-designed experiment 
conducted with 150 Saturday Evening Post subscribers in a single city (372. 
By means of specially prepared copies of one issue, some readers were exposed 
to certain ads twice, some once, and some not at all. Each reader underwent 
all three conditions, but with different ads. As a check on actual exposure to 
the ad, a tiny glue seal held the test pages together in each copy. Only when 
this seal was broken was the exposure counted for that reader. 

A few days after the last exposure, all subjects were asked to name the first 
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brand that came to mind for various products. It was thus possible to determine 
how far exposure to an ad had raised the percentage of respondents who named 
a given brand above the percentage who named it in the unexposed control 
group. Reference to Figure 86 shows that, in terms of this index of brand 
familiarity, two exposures were about twice as effective as a single exposure. 
The advantage of two exposures proved to be equally large when subjects were 
questioned about their knowledge of and belief in the claims made by the 
different brands and when they were asked what brand they thought they 
would buy if they were to purchase the product on that day. Results pertaining 
to the respondent's willingness to buy the advertised product are also illustrated 
in Figure 86. 

Considerable attention has been given to the position of an ad within 
magazines or newspapers. This question actually covers several independent 
questions. One has to do with choice of page within the medium. Extensive 
data have been gathered on the proportion of readers who look at each page in 
a newspaper or magazine. The results indicate that the content of different 
pages has more effect on reader traffic than does the physical position of the 
page. Placing an ad near a popular editorial feature gives it an advantage. 

In magazines, ads on the cover pages (inside front cover, page 1, inside and 
outside back cover) do have a significant readership advantage of 30 to 64 per 


Fig. 86 Effect of repetition of magazine advertisements upon readers’ 
reactions to advertised brands. (From Politz, 37, pp. 14, 20; reproduced 
by permission of Curtis Publishing Company.) 
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cent over inside pages (52). Among the inside pages, however, physical loca- 
tion seems to make little difference (51, 52). This uniformity of attention 
value probably results from the increasing efforts of magazine publishers to 
distribute the editorial content evenly throughout the medium. Whenever possi- 
ble, of course, ads should be placed near relevant editorial features. Thus 
food ads should be on or near the woman's page, book ads on or near the book 
review pages, and so on. Such placement helps to select an interested audience 
and also reaches the reader when he is particularly receptive to the product. 

Advertisers have also been much concerned about right- and left-hand pages, 
as well as right and left halves of single pages. Readership data, however, have 
shown negligible differences between these positions when comparisons are 
made under adequately controlled conditions (40). What differences are 
found probably result from a variety of factors, including reading habits (from 
left to right in our culture), habits of holding and turning pages, and cus- 
tomary placement of ads and features. For instance, in going through a maga- 
zine from front to back, the left-hand page is in motion when first seen and 
hence will not be perceived as clearly as the right. A number of readers, how- 
ever, have the habit of “thumbing through” a magazine from back to front, a 
practice that reverses the advantage of right and left pages. 

Color is now employed in the majority of magazine ads, and it is rapidly 
being introduced in newspaper ads as well. How effective is such a use of color 
in attracting reader attention? In Rudolph's analysis of readership data for 
Saturday Evening Post ads, two-color ads, which use only one color besides 
black, had only about 1 per cent higher readership than comparable black-and- 
white ads (4o, pp. 35-37). Since the additional cost of the two-color ads 
averaged about 17 per cent, this use of color did not seem justified. However, 
four-color ads, in which all colors can be realistically reproduced, averaged 54 
per cent higher than black-and-white ads in readership figures and only 44 per 
cent higher in cost. Today this cost differential has been reduced in some 
media, a condition that increases the net advantage of colored ads. 

In a later Starch analysis of 3,819 ads in two magazines, two-color ads again 
showed no advantage over black-and-white ads in readership; but four-color 
ads surpassed the black-and-white ads by 85 per cent for half-page ads and by 
about 50 per cent for one-page and two-page ads (45). The latter difference 
probably reflects the ceiling effect discussed earlier in connection with size 
increments. An ad that is already attracting a large proportion of readers is 
not likely to show marked gains in readership as further improvements are 
introduced. In an analysis of reader-impression data for the same ads, however, 
Starch found that color greatly augments the prestige value of the one- and 
two-page ads. The four-color ads were mentioned over 2% times as often as the 
black-and-white as having outstandingly impressed the reader. 

It might be noted that in two-color ads color is generally used not for 
realistic portrayal but as an attention-getting device. When the printed matter 
in a medium is largely black and white, the introduction of even a modicum 
of color will catch the eye. This advantage is lost, however, in a medium which 
uses color as lavishly as is done in most present-day American magazines. 
Where color is still a novelty, as in newspapers, the addition of even a single 
color to an ad will greatly enhance its attention value. In one newspaper sur- 
vey, split-run tests of identical ads in black-and-white and with small quan- 
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tities of one color added showed substantial readership differences in favor of 
the colored ads (4). 

Much depends, too, upon the way color is used in an ad. Some ads can be 
effectively presented without color and might gain little from the arbitrary 
addition of one or more colors. Others depend upon color to achieve realism, 
to display the attractive features of the product, or to create a suitable at- 
mosphere. Average readership figures may be misleading when applied in- 
discriminately to individual ads. The introduction of color may produce 
spectacular gains in the effectiveness of some ads and negligible gains in others. 
If used injudiciously, moreover, color may render an ad worse than useless. 

Like color, illustrations have been used with increasing frequency in ad- 
vertising (26). Like color, too, illustrations serve many functions. They may 
be used to facilitate comprehension of ideas, to aid in the later identification 
of the product at the point of sale, or to arouse appropriate feeling tones. For 
certain classes of products, the illustration lets the product “speak for itself,” 
as in the photograph of a delicious cake, a luxurious fur coat, or a decorative 
floor covering. 

With regard to attention value, pictures of people generally rate high. An 
analysis conducted by Starch (43) indicated that ads portraying people were 
more often read than those showing only the product. Using the read-most 
index, which is based on the proportion of respondents who have read half or 
more of the text of an ad, Starch selected the 50 most widely read and the 
50 least widely read ads from his records. The illustrations centered about 
people in 29 of the former and only 10 of the latter. Pictures of the product 
alone were found in none of the former and in 32 of the latter. 

On the other hand, it should be noted that some of the most successful ads 
portray appetizing foods that can be prepared with the given product. This 
type of ad would feature, for example, not a package of flour, but a cake baked 
with that flour, along with a recipe for the cake. The Carnation Evaporated 
Milk ad reproduced in Figure 87 is an example of such an ad. When published 
in Life magazine, this full-page four-color ad received the following Starch 
ratings for women: noted 66 per cent, seen-associated 63 per cent, and e 
most 29 per cent. In all three scores, it ranked first among women Dag. 
that issue of Life. The ad also obtained an unusually high score when evalu- 
ated in terms of the Gallup-Robinson aided-recall technique (see Chapter 10). 

Studies of the relative attention value of photographs and drawings Cor 
"artwork") generally reveal a slight advantage of photographs (59). Tu 
difference, however, is not so large today as it once was, since current artwor 
is often so realistic as to be virtually indistinguishable from photographs. The 
findings of readership surveys were corroborated in a well-designed vo 
mental study in which photographs and hand-drawn versions of the a a 4 
were compared (61). A representative sample of. 962 adults in New Yor 
City were asked to rank the ads for liking and believability. A recall test was 
also administered immediately after presentation of the ads. All three measures 
tended to favor the photographic versions. The one clear-cut exception an 
ad containing a semihumorous illustration that could be better expressed in 
cartoon form. J "IM the 

An important consideration in the choice of illustration is its relevance to | 
product, the text, and the appeal. In the use of illustrations to attract attention, 
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Fig. 87 Effective use of illustration in a high-readership food ad. 
(Courtesy of Carnation Company.) 
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rere is the danger that the ad comes to be regarded as an end in itself. The 
lvertiser should always remember that he is trying to sell the product, not 
the ad. Irrelevant, bizarre, or “arty” illustrations may attract attention to them- 
selves but usually fail to associate the ad with the product or brand name 
(28, 44, 50). 

Y related question pertains to the specific audience that the advertiser wants 
to reach. The content and form of the illustration tend to select those who will 
read the ad. Advertisements containing pictures of women, for example, are 
more often read by women; those with pictures of men are more often read by 
men (40, p. 73). Although most men enjoy looking at a picture of a pretty girl, 
when they see such a picture in an ad they evidently assume that the ad is 
directed to women. Hence they are unlikely to read further. Women react in 
a similar fashion to ads featuring men. 

Factor Analysis of Ad Variables. The potential usefulness of factor 
analysis in investigating the effectiveness of different features of advertisements 
was demonstrated in a study conducted with ads from a single issue of the 
American Builder, a monthly trade magazine (58). As in the studies discussed 
in the preceding section, the criterion of ad effectiveness was readership. The 
same technique could, however, be utilized with inquiries, sales records, or any 
other available criterion. The readership index employed in this study was the 
percentage of readers who recalled seeing any part of the ad. This is similar 
to the Starch “noted” score. The analysis was based on 137 ads, ranging in 
size from one-eighth of a page to four pages. Correlations were computed be- 
tween the readership criterion and 34 mechanical and content variables of 
the ads, Of these variables, 19 were chosen for factor analysis because of 
significant criterion correlations, satisfactory rater reliability, and experimental 
independence. À 

The factor analysis yielded six factors, the first two of which accounted for 
more than half of the variance of readership scores. The first factor, accounting 
for 41 per cent of the readership variance of the ads, was described as a pic- 
torial-color factor. Its three highest loadings on ad characteristics were found on 
“square inches of illustration,” “number of pictures showing the product in 
use,” and “number of colors.” The second factor, accounting for an additional 
12 per cent of readership variance, was identified as a size factor. 

On the basis of these factorial results, the investigator worked out a re- 
gression equation for predicting ad readership from size of ad, number of 
colors, and square inches of illustration. For purposes of cross validation, this 
regression equation was applied to the readership scores of ads in six other trade 
and business magazines obtained by the same method with other readers. 
Correlations between the obtained and predicted readership scores ranged from 
.58 to .8o for different magazines, with a mean of .71. Although these oe 
are limited to the type of magazines and readers included in the bel t T 
indicate considerable validity for the three rather simple predictors emp s : 

Analysis of Written Inquiries. The criterion of written nd has Aa 
employed to investigate the effectiveness of the major mechanica charac Fm 
tics of ads, with results very similar to those obtained in bem) studies. 
Several large-scale analyses of written inquiries have been published (39, id 
47). In the most recent and most extensive of these studies, Starch G7) ob- 
tained from 75 companies confidential records covering 8,200 on ae, 
The total number of inquiries analyzed was approximately 12 million, dis 
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tributed over a 10-year period. As in the previously described analyses of 
readership records, the ads compared in studying any one variable, such as 
color, were matched as closely as possible in other variables, such as «size, 
kind of product, company, kind of offer, and presence or absence of coupon. 
Differences in size of circulation among magazines were controlled by ex 
pressing number of inquiries in terms of per million circulation. 

The results with regard to size of ad, color, and position are essentially the 
same as those obtained in readership studies. The role of illustration was not 
analyzed. The only significant differences between the two types of criteria 
were found in the case of repetition. Although readership scores remain 
virtually the same when an ad is reinserted in different issues, inquiries do 
decline. The drop for the second insertion is about 25 per cent. There are 
smaller declines for successive insertions, the returns leveling off at about 6o 
per cent of the original number. Such a decline is understandable, since 
readers who send a written request in response to an ad are unlikely to write 
again when they see the ad a second time. Thus the later returns come largely 
from new readers. 

A comparison of the results of this investigation with those obtained through 
a similar analysis of written inquiries conducted by Starch some 3o years 
earlier is also of interest. With only one major exception, the conclusions 
drawn from the earlier study were corroborated by the later study. The excep 
tion pertains to the effect of position of the ad on number of inquiries. The 
earlier study, completed in 1930, found substantial differences in relation to 
position. Ads within the first 5 per cent of pages at the front and within the 
last 5 per cent of pages at the back elicited about 40 per cent more inquiries 
than did ads located in the middle of the issue. This advantage had disappeared 
at the time of the second survey. Starch observes that in the intervening 30 
years magazines have deliberately sought to change their makeup so as to 
distribute reader interest evenly throughout their issues. The inquiry results 
suggest that they have largely succeeded in these efforts. 

Laboratory Techniques. The direct observation of eye movements while 
subjects look at ads was one of the earliest techniques employed in psycho 
logical studies of advertising. Later investigations used eye cameras to obtain 
the same kind of data more precisely. Tachistoscopes have also been employed 
to determine what the subject sees on the page during his first very rapid 
glance. Immediately after each exposure, subjects are asked what they saw. 
Since the exposure is usually too brief to permit eye movements, the subjects 
will see only that which first catches the eye. By this method one can de- 
termine whether the objects seen first tend to occupy certain positions on the 
Page, or to be colored, or to have definite shapes, and so forth. 

Tachistoscopic presentation can be adapted for group testing by means of 
motion pictures or slide projectors. One study demonstrated a close correspond 
ence between results obtained through a group tachistoscopic procedure and 
those obtained through the more laborious and time-consuming individual use 
of the eye camera (56). Every possible pair of 10 ads was exposed for a half 
second to a total of 154 subjects. Immediately after the presentation of each 
pair, the subjects were asked to indicate on a record sheet which of the two 
ads they would look at if they were given a second look. The Purdue Eye 
Camera was employed in a second test of the same ro ads with 36 of the 
original subjects. Mean number of preferences for each ad, as determined in 
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group tachistoscopic test, correlated .79 with number of “first looks” re- 

'rded for each ad by the eye camera; the correlation of mean preference with 
number of seconds spent looking at the ad was .83. When corrected for un- 
reliability of the eye camera measures, these correlations rose to .86 and .99. 

OF particular interest is the methodology developed in a recent study of the 
visual efficiency" of printed ads, conducted under the joint sponsorship of the 
\dvertising Research Foundation and the Du Pont Company (3, 41). Visual 

efficiency of ads was investigated by means of an instrument developed for the 
purpose by HRB-Singer, Inc. A portable version of this device, now available 
for general use, is illustrated in Figure 88. Known as VISTA (Visual Testing 

pparatus), the instrument provides four measures of the visual efficiency of 
ads. Three of these measures are concerned with the distance, illumination 
level, and duration of exposure at which an ad can be correctly recognized. 
The fourth is based on a test of binocular rivalry, in which two ads are 
presented simultaneously, one to each eye, to determine which one the 
subject perceives. 

Against the four criterion measures of visual efficiency, the investigators 
checked 47 physical or mechanical ad variables. Among them were such 
traditional variables as area, color, and percentage of white space. Also in- 
cluded were two new sets of variables: one was concerned with the division 


Fig. 88 Visual Testing Apparatus (VISTA) for investigating visual A 
efficiency of advertisements in terms of exposure time, distance, illumination, 
and binocular rivalry. (Courtesy of HRB-Singer, Inc.) 
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of an ad into its component parts, or visual units; the other, with the amount 
of light reflected from the ad and its parts, from which ratios of brightness 
contrast could be computed. 

Criterion measures of the visual efficiency of ads were obtained from 
subjects, Of the 47 ad variables studied, 22 yielded significant correlations 
with one or more of the visual criteria. Multiple correlations of the six most 
predictive variables with each of the criterion measures ranged from .73 to 
81. A type of cross validation was next carried out with specially constructed 
ads. On the basis of the previously identified ad variables, two high-scoring and 
two low-scoring ads were prepared for each of the four visual criteria. 

Figure 89 shows an ad designed to score high and one designed to score 
low in the binocular-rivalry criterion. To obtain a high score in terms of this 
criterion, an ad should contain few perceptual units, should have an illustration 
low in reflectance, and should be in color. Its message should be stated dir: 
in the headline, and the ad area should comprise approximately 10 per cen 
copy, 20 per cent illustration, and 7o per cent white space. When the specially 
constructed ads were tried out on t1 subjects, significant correspondences be 


tween predicted and actual visual effectiveness were found for most ads, al- 
though not all four criteria could be satisfactorily predicted. 

This study was unusually well designed and offers several promising method- 
ological leads. Much more information is needed, however, before its con- 
tribution can be assessed. The predictive value of the ad variables investigated 
needs to be checked with other types of subjects, other types of advertisements, 
and especially with other criteria of advertising effectiveness, To be recognized 


Fig. 89 Two ads designed to score high and low in visual effectiveness 
as measured by binocular rivalry. (From Snyder and Stover, 41, p. 16.) 
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quickly, at a distance, and under weak illumination is undoubtedly an asset for 
; appearing on billboards or car cards. It remains to be determined, however, 
whether this kind of recognizability is a major factor in magazine and news 
paper advertising. A unified illustration, an informative headline, and an abun 
dance of white space are probably desirable in most display ads that receive 
only cursory inspection. Yet the highly cluttered full-page ads run in news 
papers by some drugstores and supermarkets are undoubtedly successful. These 
ads are carefully perused by readers, who use them like a catalogue in their 
search for items on sale at bargain prices. These two contrasting types of ads 
illustrate the fact that the effectiveness of different ad variables must be evalu- 
ated in the light of the objectives of the particular ad. 


TRADEMARKS AND OTHER REDUCED CUES 


Reduced Cues in Advertising. From a psychological standpoint, every 
trademark is a reduced cue. Its function is to serve as a stimulus for the recall 
of the product and company name. The trademark also tends to evoke the 
feelings and ideas that have become associated with the product through 
previous advertising and through the consumer’s own contacts with the prod- 
uct. When the trademark is a direct representation of the brand name, as with 
Camel, Raleigh, Aunt Jemima, and Smith Brothers, the association is par 
ticularly easy to establish and the chances of confusion with other products or 
brands are minimized. Frequently, the trademark is chosen so as to symbolize 
a characteristic advantage of the product or service, which the advertiser wants 
to highlight. Examples are Bon Ami’s newly hatched chick ("hasn't scratched 
yet”) and the Prudential Life Insurance Company's Rock of Gibraltar C“the 
strength of Gibraltar”). 

It is not only the registered trademark that serves as a reduced cue. Any 
distinctive recurring feature of a company’s advertising comes to stand for 
the product. An example of such a recurring element is the slogan. Some 
slogans, however, may not be closely associated with the brand name, even 
though they have strong attention value and are easily remembered in them- 
selves, When subjects are given a list of slogans and asked to write the name 
of the company or brand that uses each, a sizable percentage give the wrong 
brand names for certain slogans. The surest way to avoid this confusion is to 
incorporate the brand name in the slogan, for example, If it's Borden’s, it's 
got to be good” or "I'd walk a mile for a Camel.” When the brand name is 
not in the slogan, the advertiser must make certain that it is repeatedly and 


conspicuously linked with the slogan in his advertising. — ee 
f a company's advertising can likewise func- 


Any other recurrent feature o eee Ca Des 
tion as a reduced cue. Characteristic color combinations, printing type, layout, 


writing style, decorative design, or nature of illustration can all jupe asso- 
ciated with a specific product. When a certain style of advertising as become 
strongly identifed with one company, any attempt by a competitor to m : 
generally benefits the original company more than it benefits the imitator. Mos 
observers attribute the ad to the original company, not even noticing the com- 
petitor’s name. The effect will be a greater rise in sales for the original com- 


pany than for the competitor (see 23, p. 4492- 


Repetition with Variation. To achieve continuity while avoiding mo- 
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notony, advertisers often combine repetition of a major feature with variation 
of details. This type of advertising frequently takes the form of variations on 
a basic theme. Thus individual ads in a series may illustrate the same appeal 
by means of different incidents. Each ad would thus present the same message 
but express it differently. Or the same character may be portrayed in different 
settings. The use of familiar "trade characters" who recur in a company's ad 
vertisements not only facilitates product recognition but also helps to provide 
human interest and encourages the consumer to identify with the product. 
Successful examples of such trade characters include the Campbell kids CFig 
ure 9o), Elsie the Borden cow, Betty Crocker, and the man with the eye 
patch in the Hathaway shirt ads (Figure 91). 

Testing the Effectiveness of Trademarks and Trade Names. Psycho- 
logical research on the effectiveness of trademarks and trade names, or brand 
names, has been concerned with several different questions. One investigati 1 
undertook to measure the salience, meaning, and memory value of tradema: “s 
(5). Using a total of 166 subjects, the investigators compared the trademarks 
employed for the same product category by seven competing companies. By 
"salience," they meant the readiness with which the trademark could be seen 
and recognized. This characteristic was studied through successive tachisto- 
scopic exposures of each trademark, which increased progressively from 1 /soth 
of a second to 1 second. 

Meaning was evaluated through the Semantic Differential (see Chapter 11, 
p. 306). A meaning index for each trademark was derived from the ratings 
it received on the good-bad, strong-weak, and active-passive scales. Memory was 
tested by exposing a single slide containing the seven trademarks for 15 seconds 
and, immediately upon its removal, asking subjects to write down all trade- 
marks they recalled seeing. The order of the trademarks was rotated for 
different subjects to avoid positional effects. A memory score was computed 
for each trademark on the basis of the number of subjects who named it first, 
the number who named it second, and so on. When not named at all by a 
subject, it was scored zero for that subject. Apart from the methodological 
contributions of this study, a noteworthy finding concerns the high inter- 
correlations among the three indices of trademark effectiveness. These correla- 
tions ranged from .75 to .82 for the seven trademarks. In other words, 
trademarks that were readily perceived and that created a favorable feeling 
tone tended also to be recalled first. 

The free association technique was used in a study of the stimulus value 
of trade names (18). Three equated groups of roo students each were given 
the stimulus words Coca-Cola, Pepsi-Cola, and Dixie-Cola, respectively, with 
the instructions to write down the first word that came to mind. The associa- 
tions showed considerable clustering, the large majority of subjects responding 
with one of a small number of words. The most frequent total association 
Cregardless of stimulus) was "drink," given by 4o subjects for Pepsi-Cola, 32 
for Coca-Cola, and 11 for Dixie-Cola. The predominance of this response to 
Pepsi-Cola probably resulted from the use of the word "drink" in a Pepsi-Cola 
advertising jingle current at the time. On the other hand, it is noteworthy 
that despite the Coca-Cola slogan, “Thirst knows no season,” only two subjects 
gave the response “thirst” to the stimulus “Coca-Cola.” 

The second most frequent response was “Coca-Cola,” given by 29 subjects 
to the stimulus “Dixie-Cola” and by 22 subjects to the stimulus “Pepsi-Cola.” 
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Fig. 91 Repetition with variation in a series of ads using the same trade 
character. (Courtesy of C. F. Hathaway Co.) 


The third most frequent response was "Pepsi-Cola," given by 29 subjects to 
"Dixie-Cola" and by ro subjects to "Coca-Cola." The frequency with which 
a less familiar brand name elicited a more familiar brand name in the same 
product category is of particular interest. 

Another approach is illustrated by a "mock readership survey” designed to 
test the effect of brand names upon consumers’ reactions to advertisements 
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(27). A total of 207 men and 212 women were interviewed by the usual 
procedures followed in readership surveys. The only difference was that none 
of the six advertisements shown to the respondents had appeared in print any- 
where prior to the survey. One half of the subjects were shown a television ad 
with a well-known brand name (A), together with five ads of other products; 
the other half were shown the identical ads for the five other products and 
the same television ad, but with a much less popular brand name (B). The 
difference in popularity between the two brands is indicated by the fact that 
within the total sample about one-third of the respondents owned Brand A 
television sets, while less than 1 per cent owned sets of Brand B. 

In the readership part of the study, 61.8 per cent of the men and 63.7 per 
cent of the women said they had seen one or more of the ads. The controlled 
comparison of the two television ads yielded 23.2 per cent recognition for the 
A ad and 12.7 per cent recognition for the B ad among the men; the cor- 
responding percentages among the women were 22.4 and 11.8. Since the two 
groups of subjects who had seen the different versions of the television ad 
yielded no significant differences in their reported recognition of the remaining 
five ads, the differences between the results for Brand A and Brand B must 
be attributed to the effect of the brand names. 

After looking at all six ads, the respondents were questioned about their 
recall of the ads. The main point of the TV advertisement (i.e., “clearer 
channels") was recalled three times as often for Brand A as for Brand B among 
the men and twice as often among the women. Such findings suggest that ads 
for well-known and widely used brands are read more closely and recalled 
better than are ads for less familiar brands. Finally, when asked to rate a 
number of characteristics of the TV sets pictured in the ads, such as the design 
of the set, respondents rated the A set higher than the B set. 

Generic Trade Names. Sometimes a brand name becomes so strongly 
identified with a product that it comes to be used as a product name. Thus 
people may say Kleenex to mean any cleansing tissue or Jello to mean any 
gelatin dessert. Such a generic use of trade names has both legal and psycho- 
logical implications. Legally, a company may lose its exclusive right to the 
name if it can be demonstrated that the name is commonly employed to 
designate a product category. This was the fate of such brand names as aspirin, 
cellophane, nylon, and linoleum. Psychologically, the generic use of trade 
names may lead to the indiscriminate purchasing of competing brands. It may 
also cause increasing dissatisfaction with the original brand as a result of the 
unrecognized use of inferior brands by the consumer. i 

An experiment conducted some twenty years ago with groups of college 
students demonstrated a simple technique for measuring the extent of generic 
use of trade names (25). After explaining the difference between generic 
product names and registered trade names, the experimenter presented the 
subjects with a list of names to be marked “trade name,” "generic name, or 
"don't know.” Over two-thirds of the subjects incorrectly marked such trade 
names as Dictaphone, Mimeograph, and Thermos Bottle as generic product 
names. ; Ne 

Today, many companies make special efforts in their advertising to cmd 
their brand names against generic use. Some follow the precaution of E. oy- 
ing a product name along with the brand name, as in Ber de m m 
bandage, Prestone antifreeze, and Scotch brand filament tape. Capitalizing the 
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name, enclosing it in quotation marks, and always printing it in the same 
type are other common safeguards. The idea may also be embodied in a slogan, 
as, for example, “If it isn't an Eastman, it isn't a Kodak" An unusually 
thorough approach to the problem is illustrated by the continuing campaign 
conducted by Cluett, Peabody & Co. to educate consumers, retailers, and 
manufacturers regarding the meaning and proper use of their registered trade- 
marks, Sanforized and Sanforized-Plus (see Figure 79, Chapter 11). 

Trade-name Confusion. The legal problem of tradename infringement 
arises from the psychological facts of verbal perception and discrimination. 
When reading a word, we do not normally perceive every letter in it. A few 
letters provide enough cues for the recognition of the whole word. Conse- 
quently, if certain letters are omitted, added, or substituted, the word may still 
be perceived in its original form. 

In an early experiment on trade-name confusion, subjects were shown sets 
of 20 typewritten trade names at the rate of 1 per second (36). Immediately 
after exposure of one of these sets, 4o trade names were presented, including 20 
new names, 1o from the original list, and 1o imitations of the remaining 
original names. The percentage of subjects who "recognized" an imitation as 
though it had appeared in the original list provided a direct measure o! the 
extent of confusion between the two names. The 20 new names in the list 
yielded control data on the extent of "normal" confusion to be expected in 
such a recognition test. 

Since the trade names investigated in this study had been involved in in- 
fringement proceedings, the results of the recognition test could be compared 
with the court decisions. Of 16 imitations that had been declared to be in- 
fringements because of name similarity, only 7 were confused by 5o per cent or 
more of the subjects, some being confused by as few as 2o per cent. On the 
other hand, among the trade names declared to be noninfringements, some 
were confused by as many as 5o per cent of the cases. It thus appears that the 
legal decisions were often inconsistent with the experimental findings. Since 
the latter represent a direct measure of empirically obtained confusion, the 
validity of the legal decisions appears questionable in such cases. It might be 
added that this type of study illustrates one way in which psychological re- 
search may be utilized in the legal profession, an application to be discussed 
further in Chapter 20. 

A recent adaptation of the recognition method for measuring trade-name 
confusion is described by Weitz (60). In this method, four groups of respond- 
ents are examined by means of four different experimental procedures. In the 
first group, the original names appear on the initial list, while the imitations 
appear on the test list; in the second, the procedure is reversed. In the third, 
the originals appear on both lists; and in the fourth, the imitations appear on 
both. Several control measures can thus be obtained. 


SUBLIMINAL PERCEPTION 


In the fall of 1957 widespread public alarm was aroused by the announce- 
ment of an “experiment” conducted by a commercial firm in a New Jersey 
motion-picture theater (see 2, 24, 31, 35). During the showing of the regular 
film, the words “Eat Popcorn” and “Drink Coca-Cola" were flashed alternately 
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on the screen every 5 seconds for 1/3,oooth of a second. Because of the extreme- 
ly brief exposure, these stimuli were described as “subliminal,” that is, below 
the limen, or threshold, of perception. Nevertheless, the firm claimed that 
over the six weeks when this procedure was followed the sale of popcorn from 
the lobby refreshment stand rose 57.5 per cent and that of Coca-Cola rose 
18.1 per cent. The report of these results aroused widespread comment in the 
popular press. This type of advertising was variously described as “the super- 
soft sell,” “the invisible sell,” and “the little ad that wasn’t there.” Dismal 
pictures were painted of a nation of robots whose behavior could be con- 
trolled by suggestions of which they were not even aware. The protests and 
charges soon reached such proportions as to come before the United States 
Congress and the Federal Communication Commission. 

The specific claims made regarding the New Jersey motion-picture demon 
stration could not be evaluated because of the refusal of the commercial or- 
ganizaton to reveal the necessary details of procedure and results. Without 
such knowledge it is impossible to determine whether the reported increase in 
sales resulted from the subliminal advertising or from other uncontrolled 
factors. We would need to know, for instance, whether the size and nature 
of the audience were in any way unusual during the experimental period. 
Because of weather conditions, time of year, or the particular films shown, 
the audience might be larger than usual or might include a larger proportion 
of teen-agers, who are more likely to buy popcorn and Coca-Cola. Similarly, 
any changes made in the display of merchandise or in the operation of the 
sales booth itself would need to be investigated. It would also help to have 
more direct information on the buying behavior of the audience. Was there 
an increase in purchases made on the way in, before exposure to the subliminal 
advertising? How many customers made their purchases during the film show- 
ing and how many on the way out? 

In the absence of adequate information on the above demonstration, we may 
turn to some recent well-controlled investigations. Designed to test specific 
claims about the subliminal control of behavior, these experiments have yielded 
largely negative results. In one study (9), two groups of students watched 
the same 30-minute instructional film. During the showing of this film, a slide 
was projected on the screen for .o1 second at ro-second intervals. The slide 
used in the experimental group showed a spoon of rice with the words Won- 
der Rice" printed below it; the slide used in the control group contained only 
four lines arranged in a meaningless way. z 

At the conclusion of the film, all subjects were shown a picture of the spoon 
of rice without the name and were asked whether they had ever seen it in 
a rice advertisement. A few individuals in both groups said they had seen it, 
but the proportion did not differ significantly in the two groups—in fact, it 
was slightly higher in the control group. Regardless of their answer to the 
first question, all subjects were asked which of two brands CMonarch or 
Wonder) they believed was more likely to be associated with the picture. 
Both groups chose Monarch much oftener than Wonder, but the difference 
between the groups was insignificant. Thus the experimental subjects sub- 
liminal exposure to the words “Wonder Rice” failed to increase the probability 
of their associating this brand name with the picture. à 

In another experiment (6), the word "beef" was superimposed on a class- 
room film in flashes of 1/200th of a second every 7: seconds. The control 
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group was shown only the film, with no additional stimuli. After viewing 
the film, the experimental and control groups showed no significant differences 
in their references to beef in sentence completion and word-association tests. 
Nor was there a significant difference in reported preferences for roast beef 
sandwiches, out of a given list of five common types of sandwiches. The ex- 
perimental subjects, however, rated themselves as significantly more hungry 
than did the control group, even when time of day and interval since last meal 
were equated. Three of the original 108 subjects were eliminated from the 
analysis of results because when questioned they said they had seen a word on 
the screen. But it is interesting to note that only one saw "beef"; the other two 
perceived the word as "beer." 

In still another experiment, subliminal perception was found to have no 
significant effect upon the subject's choice of "right" or "left" circles when 
required to guess which was correct in each trial (7). Unknown to the sub: 
jects, the words "Choose right" and "Choose left" were exposed tachisto- 
scopically for durations of .o1, .o2, and .03 seconds in different parts of the ex- 
periment. The only subjects whose individual performance was better than 
chance later reported that they had seen the words and had followed the 
suggestion deliberately. In a learning experiment (59), performance in two 
different learning tasks was not significantly affected by the subliminal ex- 
posure of either correct or incorrect suggestions. In this experiment, the dura- 
tion of subliminal exposure was based on an individually determined threshold 
for each subject. 

Quite apart from its use in advertising and its potential dangers as a devious 
means of "behavior control," subliminal perception has been extensively studied 
in the psychological laboratory (see 1, 20, 31, 35). Research on subliminal 
perception dates back fully roo years. Many experiments have demonstrated 
that subjects do respond in various ways to stimuli of which they report no 
awareness. Stimuli may fall below the "awareness" threshold because of low 
intensity, brief duration, or other characteristics. To understand the subject's 
reaction to such stimuli, it is essential to consider the nature of a sensory thresh- 
old. There is no single intensity such that stimuli above it are always per 
ceptible and those below it always imperceptible. Sensitivity varies along a 
continuum; the transition from perceptible to imperceptible is gradual. 

The specific threshold established for a given subject depends in part upon 
the standard of accuracy. If the subject must give a correct response 75 per 
cent of the time, he needs a more intense stimulus than he would need to give 
50 per cent correct responses. The threshold also depends upon the nature of 
the response indicator. For instance, a stimulus may be too weak for the subject 
to report awareness of it. Nevertheless, if he is told to guess its location, he 
may be right significantly more often than by chance. Even weaker stimuli 
may lead to autonomic responses, such as a galvanic skin reaction to a word 
previously conditioned to an electric shock. Flashing such a word on a screen 
may evoke the conditioned galvanic response even when the word is too faint 
to be perceived in terms of other response indicators. 

The problem of so-called subliminal perception thus reduces to that of 
threshold differences found with different response indicators. Reported aware- 
ness is a relatively crude and unreliable indicator. Such an awareness thresh- 
old will vary with the kind of directions given to subjects, the number and 
nature of available response categories, the serial position of stimuli, and other 


331 


Perceptual and Cognitive Factors in Advertising 


extraneous factors (20). This irrelevant variance tends to raise the threshold. 
\n analogy may help to clarify the relationship among thresholds. Suppose you 
weigh yourself on an ordinary scale and find your weight to be 118 pounds. 
If you weigh yourself ten times during the day on such a scale, your weight 
will probably remain at 118 pounds. But repeated weighings on a highly sensi- 
tive laboratory scale will reveal systematic weight changes in the course of a 
12-hour period. The latter could be described as "subliminal weight changes," 
which can be detected only with a delicate indicator. 

By modifying the relative sensitivity of the two response indicators employed 
in a perception experiment, an investigator may artifically vary the extent of 
"subliminal perception" he finds from zero to any desired amount (20, p. 381). 
It can thus be seen that subliminal perception is not a special phenomenon but 
a methodological artifact. It is not a mysterious technique for “beaming ideas 
directly into the mind" while bypassing the individual's conscious defenses. 
Research on so-called subliminal perception can be more accurately described 
as the study of diverse responses to weak stimuli. So far there is no evidence to 
suggest that weak stimuli exert more influence on behavior than do strong 
stimuli. In fact, the reverse is generally found to be true. 

With regard to the use of subliminal stimulation in advertising, it should 
also be noted that individuals vary in their sensory thresholds. Hence a word 
flashed on a motion-picture screen may fall well above the awareness threshold 
of some persons and may be clearly read by them; for others, it may be totally 
imperceptible in terms of all response indicators; and for still others it may fall 
in the desired intermediate, or subliminal, zone. These thresholds also vary 
within the same person from time to time. When weak stimuli are employed, 
moreover, the probability of misperception increases. Remember the students 
who saw “beer” instead of “beef.” The advertiser who hopefully flashes the 
words "Buy Tasty Tea" on a motion-picture screen may find that many sub- 
jects actually perceive "Burn Trashy Ties"—a suggestion they may feel 
strongly tempted to accept when opening Christmas packages. E 

If at some future time it should be adequately demonstrated that subliminal 
stimulation can seriously influence a person’s actions, then psychologists 
should look to their code of professional ethics to make sure that they will use 
these techniques for the benefit of the individual and not for ulterior gains. 
Under such circumstances, too, legal controls should be instituted against 
misuse of the techniques by unscrupulous persons. But, in our present state 
of knowledge, subliminal perception does not constitute a social threat. 


READABILITY 


The term "readability" is most commonly used to mean the difficulty level of 
reading matter. In this sense, the easier a passage, the more readable it is. 
Reading ease is also affected by legibility of type, layout, and other mechanical 
features of the printed page. Similarly, one's interest in the topic influences 
the ease of reading. The measurement of readability, however, has centered on 
reading ease as determined by linguistic form. The principal question has 
been, “How can the writer express what he is trying to say so as to simplify 
the reader’s task?” : ; : 

Vocabulary. One factor that affects reading ease is the difficulty level ol 
individual words. Frequency of use can serve as an index of word comprehen- 
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sion. Common words are more likely to be understood than rare words. A 
standard source of information on frequency of word usage is The Teachers 
Word Book of 30,000 Words, compiled by Thorndike and Lorge (55). This 
book gives an alphabetical list of the more common English words, together 
with their frequency per million. The frequency data are based on several 
word counts utilizing a wide variety of sources, such as newspapers, magazines, 
children’s books, textbooks for many subjects and at various academic levels, 
literary classics in prose and poetry, elementary school readers, business and 
private correspondence, mail-order catalogues, cookbooks, and United States 
postal regulations. 

Words occurring 100 or more times per million and those occurring between 
50 and too times per million are so designated. For words occurring between 
1 and 49 times per million, specific frequencies are recorded. Supplementary 
lists of rare words, occurring less often than once per million, are also included. 
In addition, there is a list of the 500 and of the 1,000 most common words. 
The advertising copywriter can profitably check his word choice against these 
frequency lists. The use of rare words in an ad addressed to the general public 
will not only reduce comprehension and decrease readership but may also 
antagonize the reader. 

Readability Formulas. For more than forty years, psychologists and ed- 
ucators have been studying the effect of different linguistic factors upon read- 
ing ease. Those characteristics of reading matter found to correlate most highly 
with criteria of reading ease (such as comprehension, reading speed, or grade 
level of content) have been combined into readability formulas (see 8, ch. 3). 
These formulas are designed to estimate the difficulty level of a passage on the 
basis of such factors as word frequency, word length, number of abstract words, 
sentence length, sentence structure, and number of personal references. Since 
most of these factors are highly intercorrelated, the various formulas derived 
from them yield fairly similar results. 

One investigator who has done much to publicize readability research is 
Rudolf Flesch. In 1946, three years after the completion of a doctoral disserta- 
tion on readability, Flesch published a popular book entitled The Art of 
Plain Talk (13). This book was followed by a series of other writing guides 
which also provided revisions and simplifications of the Flesch readability 
formula (15, 17). In developing his original formula, Flesch used as his 
criterion a series of graded passages from the McCall-Crabbs Standard Test 
Lessons in Reading. These passages had been standardized on the basis of 
number of questions correctly answered by children of known reading ability. 
The criterion of reading ease was thus based on empirically determined com- 
prehension. 

, The most widely used form of the Flesch formula (14; 15, pp. 213-216) 
yields two measures, a "reading ease" score CRE) and a "human interest" 
score CHI). Reading ease is based on average word length in syllables (wl) 
and average sentence length in words (sl). These figures are inserted in the 
following formula: 


RE = 206.835 — .846wl — 1.015sl 


The human interest score is derived from the percentage of “personal words” 
(pw), such as proper names and personal pronouns, and the percentage of 
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"personal sentences" (ps), such as spoken sentences, questions and other re 
marks directed to the reader, and exclamations. The formula used for this 


score is as follows: 


HI = 3.635pw + .314ps 


The unit to which all these measures are applied is a continuous 10o-word 
passage. For reliable results, several random 1oo-word samples from the given 
selection are analyzed, and the counts are averaged. 


.E 19 Interpretation of Flesch Reading Ease Score 
(Data from Flesch, 15, pp. 149-150) 


Description 
of Style 


Very easy 
Easy 

Fairly easy 
Standard 
Fairly difficult 


Difficult 


Very difficult 


RE 
Score 


90-100 
80- 90 
70- 80 
60- 70 
50- 60 


30- 50 
0- 30 


Estimated 
Reading 
Grade 


5th 
6th 
7th 
Bth-9th 
High school 
CIOth-12th) 
College 
(I3th- 16th) 
College graduate 


Typical 
Magazine 
Comics 
Pulp fiction 
Slick fiction 
Digests 
Quality 


Academic 


Scientific 


In both of these formulas, the weights and constants were so chosen as to 
yield scores ranging from o to 100. The relative weights of the two factors 
in each formula were derived from correlations with the criteria of reading 
ease and human interest. Tables 19 and 20 provide an interpretation of these 


two scores in terms of academic levels and types of magazines. 


To facilitate the computation of the Flesch RE and HI scores, tables have 
been prepared in which the scores can be found by merely entering the ob- 
tained word and sentence figures (11). Further simplifications of the Flesch 
formulas have subsequently been proposed by Flesch himself (17, ch. 6 and 
pp. 163-174) and by others (10, 12). Separate formulas have also been sel 
veloped for measuring the level of abstraction of writing (16, 19). Although 
useful for special purposes, all such indices are highly intercorrelated (8, ch. 3; 


(Data from Flesch, 15, p. 151) 


TABLE 20 Interpretation of Flesch Human Interest Score 


ipti Typical 
D HI a 
dum Score Magazine 
Dramatic 60-100 Fiction 
Highly interesting Ta e D 
Interesting n be Dies 


Mildly interesting 


Scientific 


Dull 
SUA. in Eee 
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38). Abstract words, for example, tend to be longer than concrete words and 
to have lower frequency of usage. 

Readability and Readership. Of all the readability formulas, those de- 
veloped and popularized by Flesch have had the greatest impact upon journal- 
ists, advertising copywriters, and others interested in reaching mass audiences. 
Concern with the "Flesch index" of one's writing soon spread through these 
fields, with a resulting simplification of style (see 8, pp. 146—148). Empirical 
tests of the effects of such simplifications, however, have been few. 

Splitrun readership tests with easy and hard versions of the same articles 
have been conducted in such varied media as a Middle Western farm paper 
(30, 33, 34), a campus newspaper (53), and a company paper sent to em- 
ployees (54). The readership gains reported for the simpler version range from 
zero Cor a slight loss) to as high as 66 per cent. Among the factors determining 
these results are the absolute difficulty level of the versions as well as the 
amount of difference between them. Intrinsic interest of the content for the 
readers may also partly erase differences in reading ease. Another diffculty 
in interpreting results is that some of the versions differ in ways not messured 
by the Flesch scores. For example, the pair of articles yielding the largest 
readership difference (66 per cent) also differed in organization, the easier 
version being more systematically and clearly organized (see 8, pp. 104-105). 
In another case (53), the more difficult version sounded stilted and unnatural. 
It contained inappropriate words and circumlocutions that seemed to have 
been deliberately inserted to increase reading difficulty. 

Another type of study has been concerned with the reading ease of existing 
material, rather than employing specially prepared versions. In an analysis 
of all major articles appearing in a single issue of the Saturday Evening Post, 
readership and degree of satisfaction were surveyed in a nationwide sample 
of 340 readers (22). An abstraction index computed for each article correlated 
—.79 with “finishing index,” or proportion of readers starting the article who 
finished it. On the other hand, among those who finished an article, the pro- 
portion who rated it “excellent” correlated +.80 with the abstraction index. 

In another study (57), the readability of editorial and advertising copy was 
compared in two magazines. The Flesch formulas for reading ease and human 
interest were applied to a random selection of issues of Time and Newsweek 
for two five-year periods: 1936-1940 and 1945-1949. In both periods and both 
magazines, advertising copy was easier to read on the average than editorial 
copy. The difference seemed to be shrinking, however. Advertising copy had 
become more difficult from the prewar to the postwar period, while editorial 
copy had become easier. The human interest scores showed negligible differ- 
ences in the various comparisons. The type of comparisons made in this study 
are of particular interest, since the editorial content of a medium tends to “se 
lect the audience. Hence an analysis of editorial content provides the adver- 
tiser with a useful guide to the difficulty level and style appropriate for such 
an audience. 

General Evaluation. Readability formulas have met with a varied re- 
ception. They have been hailed enthusiastically as guides to good writing and 
attacked vigorously as mechanistic devices to stifle creativity and debase litera 
ture. After looking with horror at figures such as those reported in Tables 19 
and 20, the critics have asked, “Should writers of science and literature try 
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to raise their ‘Flesch index’ by imitating the style of pulp magazines and 
comic strips?” 

The truth lies somewhere between these extreme views. Readability re- 
search can improve writing, but its findings must be used with discretion. 
Good writing depends on other factors besides those included in readability 
formulas. Then, too, there is undoubtedly an optimum difficulty level for 
different kinds of materials, as there is for different audiences. Intrinsically 
simple material becomes stilted, pretentious, and obscure when expressed in 
long words and intricate sentences. On the other hand, literary or scientific 
writings may lose in aesthetic value or in intellectual precision when over- 
simplified. Unusual words and subtle figures of speech may be a source of 
artistic delight in poetry. And the technical terms of science, mathematics, 
and logic represent a way of simplifying and clarifying complex masses of in- 
formation that might otherwise prove unmanageable. 

Simplicity and clarity of expression cannot be achieved by routine pro- 
cedures. They are general goals of good writing that need to be pursued by 
whatever means are appropriate to the subject matter. Readership formulas 
are appropriate when the object is to maximize the communication of simple 
messages. This is the purpose of advertising copy, cookbooks, instruction sheets, 
training manuals, government bulletins, and similar forms of writing designed 
for mass audiences. It is to these forms of writing that readability formulas are 
most directly applicable. 


MARY 


For effective communication, the advertising message should be so presented 
as to be noticed, understood, retained, and accepted. The success of an ad in 
attracting and holding attention depends in part upon such mechanical factors 
as size, position, color, and illustration. Among the major techniques now used 
to study the effects of these factors are readership surveys, analysis of written 
inquiries, and laboratory techniques. Dummy magazines have been used in con- 
trolled experiments on readership. Factor analysis of ad variables has a'so been 
carried out with a readership criterion. Laboratory procedures adapted for the 
measurement of attention factors in advertising include eye cameras, tachis- 
toscopes, and a specially developed instrument that permits control of ere 
illumination, and exposure time as well as the binocular presentation of dif- 
ferent ads. : 

Any feature that is repeated in a series of advertisements comes to dels 
a reduced cue for the product and the company. Such reduced ee "i hide, 
not only registered trademarks and brand names, but also slogans, trade charac 
ters, and other recurring aspects of ads such as color combination, printing 
type, layout, writing style, and nature of illustration. In an oe iae 
paign, repetition with variation provides continuity and fami n. pa 
monotony. Psychological research on trademarks and trade names as = 
concerned with such questions as recognizability, retention, and associa vs 
value; effect of brand names upon consumers’ reactions to ads; generic tra 
names; and trade-name confusion. > j aa i 

Subliminal perception is the process of responding to ueni that ed 
low the threshold of verbally reported awareness. Essentially it refers to 
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old differences associated with different response indicators, a problem on 
which extensive psychological research is available. Popular claims regarding 
the effectiveness of subliminal advertising in influencing buying behavior are 
as yet unverified. The findings of laboratory experiments suggest that sub- 
liminal stimulation cannot appreciably affect the behavior of mass audiences. 
Readability research is concerned with the difficulty level of reading matter, 
as determined by its linguistic form. Among the factors affecting reading ease 
is vocabulary level, which can be checked against available word frequency 
counts. Readability formulas try to predict reading ease by an analysis of such 
variables as word frequency, word length, number of abstract words, sentence 
length, sentence structure, and number of personal references. Special formulas 
have been developed to gauge "human interest," "abstraction," and other 
aspects of reading matter. The formulas developed by Rudolf Flesch have 
been the most influential in journalism, advertising, and other mass media. 
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Diagnosis 


‘The focus of clinical psychology is on the individual. Clinical psychologists 
work with persons who have intellectual or personality disorders. Today 
clinical psychologists are functioning in a variety of settings, including not 
only hospitals and outpatient clinics, but also industry, schools, prisons, courts, 
government agencies, and the armed services. In industry, for example, the 
number of employees with personality disorders is large enough to constitute 
a serious problem (24, chs. 20 and 21; 49). Such emotional difficulties occur 
at all levels, from unskilled labor to top management, and range from the 
mild problems of essentially normal persons to full-blown psychoses. The 
effects of these conditions upon job performance may take many forms, in- 
cluding excessive absenteeism, increased accident rate, deterioration of output, 
and disruption of co-worker morale. 

The practice of clinical psychology utilizes findings and techniques from 
many branches of psychology, such as abnormal psychology, personality theory, 
and psychological testing. The specific activities of clinical psychologists consist 
chiefly of diagnosis, therapy, and research, although the emphasis placed upon 
each varies widely with the work setting and with the individual psychologist. 
Moreover, these three functions are frequently intertwined. Research may be 
specifically concerned with the validity of diagnostic tools or with the effective- 
ness of different types of therapy. Similarly, diagnostic and therapeutic pro- 
cedures are indistinguishable in certain types of clinical practice, such as 
psychoanalysis. It is only for convenience that in this book diagnosis, therapy, 
and research are discussed in separate chapters. 

Both clinical psychology (Chapters 13-15) and the closely related field of 
counseling psychology (Chapters 16-17) are marked by an abundance of un- 
resolved issues and active controversies. In part, this situation stems from 
complexity of subject matter and from the comparative youth of these fields. 
Another factor making for diversity of opinion, however, is that both theory 
and practice in these fields tend to be highly colored by human values. Dis- 
cussions of the objectives of counseling and psychotherapy, or definitions of 
“mental health,” for example, often touch closely upon one’s concept of a 
“good life.” Thoughtful clinical and counseling psychologists have faced these 
issues, as will be noted in several connections in these chapters. But unanimity 
has by no means been attained. While complacency sr pud e not aci 
terize any field of applied psychology, turbulence and struggle are especially 
typical of the ipic and counseling areas. Although this introductory book 
focuses primarily upon what is presently known, many problems remain to 
be solved. The fact that much can be found in the common ground, about 
which considerable agreement exists, should not lull the reader into assuming 


an absence of controversy. 
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OBJECTIVES OF CLINICAL DIAGNOSIS 


In the present context, the term "diagnosis" is used in its broadest sense, to 
cover all aspects of clinical evaluation or assessment. Essentially diagnosi: is a 
fact-finding process. Among the various objectives of the diagnostic process 
in clinical psychology, four can be clearly identified: (1) screening and classi- 
fication, (2) personality description, (3) prediction of outcome, and (4) at- 
tainment of insight by the client. 

Screening and Classification. To the layman, diagnosis generally means 
attaching labels to people. This is the traditional approach to diagnosis, which 
clinical psychology has inherited from nineteenth-century psychiatry. In psy- 
chiatric hospitals, a good deal of attention is still given to the categorizing of 
patients by various diagnostic labels. It is generally recognized, however, that 
such labels can serve only for purposes of preliminary screening and rough 
classification. 

A major diagnostic distinction is that between mental deficiency and per- 

sonality disorders. In terms of intelligence test performance, mental deficiency 
usually corresponds to an IQ of 7o or less. Within this category, further 
subdivisions are traditionally made into three levels: mild (IO 50 to 70), 
moderate (IQ 20 to 50), and severe (IQ below 20). A more functional classifi- 
cation that is rapidly gaining acceptance recognizes those cases requiring 
custodial or nursing care, those that are trainable by specialized individual 
procedures, and those that are teachable in special classes. Decisions regarding 
the disposition of individual cases, moreover, should be based on a com- 
prehensive study of the individual's present functioning level, his develop- 
mental history, his family background, his physical condition, and other 
pertinent data. 
_ Although psychological research on mental deficiency has lagged behind the 
investigation of other disorders, it now constitutes a very active area of re- 
search. Some progress has already been made toward an understanding of the 
nature and causes of mental deficiency. Organic defects underlying certain 
forms of mental deficiency, such as phenylketonuria and mongolism, have 
been identified. Some suggestive research has been done on the role of cultural 
deprivation in the production of intellectual retardation. There has also been 
an upsurge of interest in the training and rehabilitation of mental defectives. 
Further information regarding psychological work with mental defectives can 
be found in several recent books covering different aspects of the problem 
(37, 42, 62, 66). 

Among personality disorders, psychoses are commonly distinguished from 
neuroses. Psychoses represent the more severe disorders, which usually require 
institutionalization. The largest single group of psychotics, accounting for about 
half of the chronic patients in state hospitals, are diagnosed as schizophrenic. 
Although including several forms with distinct behavior patterns, schizo- 
phrenia as a whole is characterized by extreme withdrawal, loss of contact 
with other persons, and emotional dulling. Also typical of schizophrenia are 
such symptoms as hallucinations, disorganization of thought processes, an 
peculiar mannerisms of speech and gesture. In the catatonic form, the patient 
may remain motionless in bizarre postures for hours. In the paranoid form, he 
may have delusions in which he believes, for example, that he is being 
persecuted for some fanciful reason. 

Although less common than schizophrenia, manic-depressive psychoses con 
stitute another widely recognized category. These conditions are characterized 
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by periods of extreme depression, hopelessness, and slowing down of thought 
and action and by periods of elation, overactivity, and flight of ideas, in which 
one thought leads to the next through superficial associations. The same pa- 
tient may exhibit both types of behavior at different times; or either type alone 
may alternate with periods of normality. Hallucinations and delusions may 
occur during both depressed and manic phases. 

Mention should also be made of organic psychoses, which have identifiable 
physical pathology resulting from such factors as drugs, alcohol, senile de- 
terioration, or brain injuries. By contrast, such disorders as schizophrenia and 
manic-depressive psychoses are sometimes described as “functional” or “psycho- 
genic,” since no physiological basis has been conclusively demonstrated for 
them. Many psychologists object to these terms, however, because they imply 
more knowledge of etiology than is currently available. Although extensive 
research on the causes of these disorders is in progress, their etiology still re- 
mains largely unknown. 

Neuroses (or psychoneuroses) are relatively mild personality disorders in 
which the individual remains in touch with reality and shows no gross dis- 
orders of thought or action. These conditions are generally considered to be 
psychogenic and amenable to treatment by psychological techniques. Unlike 
the psychotic, who may be blissfully unaware of reality, the neurotic typically 
feels miserable. Often his emotional problems interfere with interpersonal 
relations and lower his productive capacity in school or on the job. Because 
he is aware that something is wrong and is unhappy about his condition, the 
neurotic is more likely to seek and continue treatment than is the psychotic 
and he is more likely to cooperate actively in the treatment. 

One of the most characteristic neurotic reactions is anxiety. The individual 
experiences feelings of dread and apprehension out of all proportion to the 
situations that arouse them. The anxiety may also be generalized, or “free- 
floating,” rather than being associated with any specific stimulus. In some 
cases, traditionally classified as “anxiety hysteria,” minor environmental stresses 
produce attacks of acute anxiety accompanied by palpitation of the heart, 
profuse sweating, difficulty in breathing, and tremors. Arp: f 

Another traditional neurotic category is “conversion hysteria, in which 
the individual develops some physical ailment, such as paralysis of his right 
arm, that enables him to escape from an unbearable emotional situation. Other 
types exhibit “phobias,” or irrational fears of specific objects or situations, such 
as claustrophobia (fear of closed-in places) or acrophobia (fear of heights). 
Still another example is the "obsessive-compulsive" type, characterized by 
persistent thoughts of a useless or irrational nature or by a strong compulsion 
to perform certain meaningless acts, like repeated hand washing. A particularly 
dramatic form of neurotic behavior is that in which the individual dissociates 
himself from unpleasant life situations by forgetting who he is and other 
details of his life. In rare cases, such amnesia may last for several years, the 
individual assuming a new name, working at a different occupation, ane 
living his life apart from his earlier contacts. Also rare is the condition o 
"multiple personality," in which the Rud alternates between two or more 
distinct personalities (see, e.g., 72. . ; 

"These and other seg d.i reactions have been traditionally Signa 
into specific "syndromes," or associated groups. of symptoms. ae EE d 
diagnostic categories, as well as fuller descriptions of both psychotic an 
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neurotic behavior, can be found in textbooks of psychiatry or abnormal psy- 
chology (see, e.g, 8, 32). Most clinical psychologists, however, avoid the use 
of specific diagnostic labels in evaluating individual cases. More and more it 
is being recognized that the syndromes of the classical diagnostic systems do 
not fit the cases encountered in practice. In fact, "textbook cases" that do fit 
the descriptions are hard to find. 

Some attempts have been made to develop new systems of classification, 
especially for psychotic behavior, by investigating the empirical concomitance 
of symptoms within individual cases. Several studies have employed the tech- 
niques of correlation and factor analysis for this purpose and have proposed 
sets of categories corresponding to the factors thus identified (14; 18, ch. 1; 
23; 27; 45; 82). Although these studies provide many suggestive findings, no 
single schema of classification has emerged that has met with general ac- 
ceptance. 

Part of the resistance to the adoption of a new system of classification un- 
doubtedly stems from the inconvenience of effecting a change-over in institu- 
tional records. From a different angle, however, many clinical psychologists 
reject all attempts to place individuals into diagnostic categories. They point 
out that, even when derived from empirically established correlation of symp- 
toms, these categories still represent only group trends. The combination of 
specific symptoms of maladjustment is unique to each individual. Then, too, 
behavior traits vary in degree from person to person. Any classification in terms 
of presence or absence of a given symptom is therefore unrealistic. 

Still another argument against the use of diagnostic categories is that psy- 
chological disorders are not "disease entities," like pneumonia or scarlet fever. 
To classify an individual as a catatonic schizophrenic tends to create the mis- 
leading impression that he is suffering from a specific psychological “disease.” 
In reality, however, he manifests a collection of loosely related disorders of 
behavior that may have complex causes and probably require different treat- 
ment in individual cases. Rather than being identified as symptoms of “mental 
illness,” such behavior disorders are coming to be recognized more and more as 
direct manifestations of the conflicts and problems of daily living. This point 
of view has been most clearly expressed by Szasz in a book entitled The Myth 
of Mental Illness (69). i 

Personality Description. A major objective of the clinical psychologist’s 
diagnostic function is to provide a personality description of the individual 
case. Such a description is unique for each person, covering the specific be- 
havioral difficulties the individual manifests as well as the antecedent circum- 
stances that led to their development, In contrast to traditional diagnostic 
labels, such a personality description is a full and detailed report. Its object 
is to facilitate the understanding of what makes the individual behave as he 
does and to serve as a guide for planning specific therapeutic procedures. It 
also differs from the traditional diagnostic approach in its utilization of a 
normal rather than a pathological orientation. The patient's behavior is seen 
as an exaggeration or distortion of behavior mechanisms commonly used by 
normal persons in meeting the conflicts and frustrations of daily living. 

The terms, concepts, or constructs employed in such personality descriptions 
vary somewhat with the personality theory favored by the particular clinician, 
although considerable progress has been made in integrating these different 
points of view. Typically, behavior is described in terms of the adjustment 
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mechanisms that the individual utilizes to cope with his emotional problems, 
resolve conflicts, and reduce anxieties and tensions. Detailed discussions of 
these adjustment mechanisms can be found in books on the psychology of 
idjustment, mental hygiene, or personality (e.g., 48, 64). 

A few examples will suffice to show what is meant by an adjustment mech- 
anism. One way in which an individual may react to an unpleasant or con- 
flictual situation is by escape, as in excessive daydreaming. Another escape 
mechanism is illustrated by the shy, seclusive person, who thereby avoids ex- 
posure to possible embarrassment, failure, or abuse in social contacts. The ex- 
treme withdrawal of the schizophrenic may be recalled in this connection. 
A familiar defense mechanism is rationalization, whereby the individual tries 
to "explain away" the source of conflict. Thus the failing student may assure 
himself that "enjoying life" and "being a regular fellow" are more important 
than "being a grind" and earning good grades. A related mechanism is projec- 
tion, or the practice of attributing our shortcomings to someone else. The 
student who justifies his failure by proclaiming that the instructor did not 
teach the course properly typifies this kind of defensive behavior. In the 
psychotic, such rationalizations and projections may take the form of a highly 
organized delusion. The patient may thus become convinced that he is really 
a brilliant scientist who is being kept prisoner by his enemies—and possibly 
"brainwashed" in the process. k 

All typical psychotic and neurotic reactions, including phobias, compulsions, 
obsessions, amnesias, psychogenic ailments, and so forth, can be reformulated 
in terms of adjustment mechanisms. Since they are applied to specific types of 
behavior rather than to the individual as a whole, adjustment mechanisms 
permit more descriptive flexibility than is possible with traditional diagnostic 
categories. Moreover, these mechanisms provide hypotheses for exploring the 
sources of the individual’s behavioral aberrations. : 

It should be noted that all these adjustment mechanisms may be effectively 
employed by normal persons in certain situations. As used by neurotics and 
psychotics, however, they represent ineffectual and superficial solutions for 
the individual's emotional problems. They fail to eliminate his tensions and 
anxieties, although they may temporarily reduce their intensity. Furthermore, 
the individual's irrational behavior is itself disabling and hence creates new 
difficulties of varying degrees of severity. The purpose of therapy is to replace 
these ineffective problem-solving devices with more effective solutions — 

Prediction of Outcome. One of the objectives of clinical evaluation is 
prognosis, or prediction of outcome. Since outcome may be defined in many 
ways, the prediction problem covers several more or less related E reu 
One question pertains to the probability of improvement, recovery, or i 
terioration in the absence of therapy. Such information is useful, for example, 
in matching control and experimental samples in research on the effects 5 
therapy. It also contributes to practical decisions regarding the disposition of 
individual cases. i 

Another question concerns the individual's response to therapy. He if T 
outpatient clinic has a long waiting list, cases can be selected on the | asis o 
their estimated chances of improving with therapy. If those who fail to te- 
turn after one or two visits and those who continue in therapy but derive 
no benefit from it can be weeded out in advance, available therapeutic re- 
sources can be more effectively utilized. In accepting clients for psychotherapy, 
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moreover, most clinical psychologists make an effort to screen out the type 
of cases that do not respond well to this form of therapy. 

With institutionalized cases, a further question is that of posthospital ad- 
justment. Decisions regarding discharge depend upon this type of prediction. 
The criteria against which such predictions can be tested include rehospitaliza- 
tion within specified periods of time, as well as various indices of the in- 
dividual's level of functioning in the community. Among these indices may be 
mentioned objective records of educational or vocational achievement, ratings 
by family members and other associates, and the individual's own report of his 
feelings. 

It can be readily seen that the criteria of outcome vary widely. This variation 
complicates research on the effectiveness of different forms of therapy, a 
difficulty that will be examined further in Chapter 15. For the present purpose, 
it is apparent that the validity of any given predictor may differ from one type 
of outcome to another. Considerable research has been done on the prognostic 
use of both tests and case history data in the effort to provide a systematic basis 
for clinical prediction (see 15, 19, 44, 56, 81, 83). An example of a relatively 
promising test developed to predict response to psychotherapy is Darron's 
Ego-Strength Scale (4). Consisting of those items from the Minnesota Multi- 
phasic Personality Inventory that differentiated between improved and un- 
improved psychoneurotic groups, this scale proved successful when cross- 
validated against improvement criteria in several new samples (4, 19). 

Analysis of case history material on schizophrenics has suggested a useful 
distinction between process and reactive schizophrenics (33, 39). One of the 
chief differentiating features of these two groups is found in the onset of the 
disorder, which is slow and gradual in the process group and sudden in the 
reactive group. In the latter cases, too, the psychotic condition is often pre- 
cipitated by a highly stressful experience. There is fairly extensive evidence 
to show that prognosis is more favorable for the reactive than for the process 
group. Other systematic analyses of premorbid data have yielded promising 
indices for forecasting schizophrenic adjustment both in the hospital and upon 
restoration to the community (56). 

A final example is provided by a survey of posthospital adjustment of dis- 
charged patients (15). In terms of both rehospitalization and community- 
adjustment criteria, this study showed that the environmental milieu to which 
the patient returns is a major factor in the success of his posthospital adjust- 
ment. For instance, married women with young children were more likely to 
make a good adjustment than were single women living with their parents 
or other relatives. Data regarding the condition of these cases at the time of 
admission and discharge indicated that this difference in adjustment could 
not be attributed to selective factors. The investigators suggest that a family 
milieu in which the patient is required to take responsibility, as in managing 
a home and caring for young children, is more conducive to recovery than one 
in which the "sick" role is accepted and even encouraged by solicitous relatives. 

The same study found that educational and socioeconomic variables are 
significantly related to outcome. Women who were college graduates, for in- 
stance, made a much more successful posthospital adjustment and were less 
often rehospitalized than women with only a grade school education. A 
similar relation was found between outcome and socioeconomic status. It 
should be noted that educational and socioeconomic factors may enter the 
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picture in at least two ways. They not only determine certain features of the 
posthospital milieu to which the individual returns but also reflect charac- 
teristics of the patient himself—such as intellectual level, persistence, and self- 
confidence—that may be directly related to outcome. 

Attainment of Insight. An important goal of the fact-finding, or ex- 
ploratory, aspect of the clinical process is the attainment of insight by the 
client. Although any discussion or sharing of diagnostic findings with the 
client may contribute to such insight, certain clinical approaches are primarily 
oriented toward this objective. In nondirective, or client-centered, therapy, 
the principal goal is to encourage the client to verbalize his anxieties and 
examine his problems. Through such self-exploration, the individual comes 
gradually to understand his motives and actions. The insight he thus gains, 
it is believed, will enable him to deal with his own conflicts and to manage 
his behavior more effectively. 

Similarly, psychoanalysis is specifically designed to make the individual 
aware of the origins of his emotional difficulties, which are often traced back 
to early childhood experiences. Free association and dream analysis are among 
the psychoanalytic techniques used for this purpose. Other common clinical 
procedures, such as play techniques and psychodrama, are also employed, with 
the dual objective of furthering the clinician’s understanding of the client and 
the client’s understanding of himself. M 

The utilization of diagnostic techniques for the promotion of client insight 
illustrates the interrelation of diagnosis and therapy in clinical psychology. 
Insight plays an important part in the major forms of psychotherapy (see, e.g., 
28, ch. 4). The methods cited above for the attainment of insight represent 
well-known therapeutic procedures. As such, they will be considered more 
fully in Chapter 14. They are mentioned here simply to highlight the fact 
that, in much current clinical practice, diagnosis and therapy cannot be 
differentiated with regard to temporal sequence, methods, or objectives. Even 
a brief contact during which the client has an opportunity to describe his 
problem to a clinician may have therapeutic value. For that matter, when the 
client decides to consult a clinician and puts that decision into practice by 
keeping his first appointment, that action itself may contribute to his recovery. 

There is also a growing tendency among clinical psychologists to de-empha- 
size diagnosis, especially in the narrow sense of advance understanding by 
the clinician (see, e.g., 52). Many believe that such advance understanding of 
the client's problems does not speed up therapy. Some are even skeptical about 
the contribution of causal understanding in general to the treatment process. 
To be sure, diagnostic procedures of various sorts are still widely employed in 
clinical practice, and extensive research is being devoted to their APER 
(see Chapter 15). Nevertheless, the gradual recognition that at least some z 
these techniques are of doubtful validity has opened up the whole question o! 
whether they serve any useful function. 


SOURCES OF INFORMATION IN 
CLINICAL DIAGNOSIS 


In the effort to understand the client’s problems, clinical psychologists may 
turn to three major sources of data, namely, case history, psychological ur 
and diagnostic interview (see 22; 28, part III; 64, pp. 491-513). Depending 
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upon their theoretical orientation, the setting in which they function, and the 
type of client with whom they are dealing, clinicians will vary in the extent 
to which they employ each of these sources. 

Case History. Utilizing a reverse longitudinal approach, the case history, 
or anamnesis, provides data about the client's reactional biography. It covers 
family background and current familial situation, educational and vocational 
history, and other important facts regarding the individual's physical and 
psychological development from the prenatal stage to the present. Information 
may be gathered through interviews with the client himself, members of his 
family, teachers, employers, co-workers, friends, or anyone else who has had 
contact with the client. School and employment files, agency reports, court 
records, and other official sources may be consulted. Unless an adequate 
medical report is already available, a medical examination is usually requested 
to round out the picture and to identify any conditions that may require treat- 
ment by a physician. In certain settings, the services of a social worker are 
utilized in obtaining at least part of the case history material. 

Psychological Tests. In clinical practice, psychological tests are commonly 
employed to determine general level of intellectual functioning and to check 
for evidence of intellectual impairment or deterioration. A wide variety of 
personality tests is also available for investigating emotional disorders, strength 
of various needs, interests, attitudes, adjustment mechanisms, self concepts, 
interpersonal relations, and many other facets of behavior. Major types of 
tests used in clinical practice will be examined in later sections of this chapter. 

Interviewing Techniques. The diagnostic interview represents a par- 
ticularly important part of the process of clinical assessment (6, ch. 13; 28, 
ch. 15). Although a single intensive interview may serve for this purpose, 
frequently a series of interviews is required. In a diagnostic interview, the 
client has the opportunity to describe his problem as he sees it and to talk 
freely and fully about himself. Interest generally centers on how the client 
perceives events in his life and how he reacts to them, rather than on the 
objective events as such. Clinicians differ, however, in the extent to which 
they ask questions designed to elicit information about specific behavior 
difficulties, sources of conflict, or symptoms. In the nondirective approach, for 
example, the client is simply encouraged to talk about anything he considers 
important. 

Although the interview is largely a medium for verbal communication, it 
also provides many opportunities for direct behavioral observations. Facial 
expressions, laughter, crying, blushing, muscular tics and grimaces, gestures, 
posture, and other expressive reactions serve as additional cues to the client’s 
feelings. In the more severely disturbed cases, such symptoms as hallucinations, 
delusions, and flight of ideas may become apparent in the course of the inter- 
view. 

For maximal effectiveness, the interview requires good rapport between 
clinician and client. Essentially this means that the client must have con- 
fidence in the clinician, so that he feels free to communicate. Ordinarily an 
individual shows some resistance to discussing or even thinking about certain 
highly emotional matters. There are several ways in which the clinician may 
help to overcome such resistance. Privacy and confidentiality of communication 
are of course essential. Through his own comments and behavior, moreover, | 
the clinician can indicate that he is interested, attentive, and seriously con- 
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cerned about the client's problems. The clinician's manner should be warm, 
friendly, and relaxed, while at the same time remaining impersonal and pro- 
fessional. 

Most clinical psychologists today put the greatest emphasis on the establish- 
ment of a permissive atmosphere. By this is meant that the clinician refrains 
from judging or evaluating anything the client communicates—he expresses 
neither approval nor disapproval but concentrates on understanding and 
clarifying the client's own feelings. Although “permissiveness” is most closely 
associated with the client-centered school of clinical practice, its advantages— 
at least at certain stages—are quite generally recognized. In actual practice, 
however, it is difficult for any clinician to withhold entirely all expressions 
of approval or disapproval. Differential reinforcement of topics for discussion 
or of different types of patient responses is likely to be provided even by the 
most nondirective of clinicians. Some research findings on the effects of subtle 
cues in the clinician’s behavior upon the course of client responses will be 
considered in Chapter 15. 

It should also be noted that in much current clinical practice the diagnostic 
interview merges imperceptibly with the therapeutic interview. This is par- 
ticularly true of those approaches that stress the therapeutic value of the client- 
therapist relationship. 


NATURE OF CLINICAL JUDGMENT 


Two more or less related questions can be asked about the clinician’s diagnostic 
function: “What does he do?” and “How well does he do it?" A fair amount 
of research and considerable discussion have been devoted to both questions. 
Nevertheless, psychologists are still a long way from reaching satisfactory 
answers. The only matter on which complete agreement could be obtained 
today is that “the nature of clinical judgment” is a topic that will guarantee a 
lively symposium. 

Person. COEUR What the clinician does in assessing a client may be 
regarded as a special case of person cognition, or interpersonal perception G, 
9, 20, 21, 63, 70). Through what process does anyone come to know and 
understand another person? This is a broad question that has been approached 
from the standpoints of epistemology, the psychology of perception, and zw 
cept formation, among others. Investigations of the accuracy of interpersona 
perception and of the factors that affect it are beset with methodological pitfalls 
that have only recently been recognized. Generalizations about one's ees 
to judge others are misleading unless they specify the type of judgment ani 
the circumstances under which it is made. Moreover, spurious impressions of 
accuracy can arise from failure to control certain variables, such as degree of 
resemblance between judge and subject or degree of resemblance [Siem 
subject and a group stereotype in terms of which the judge may be responding 
(20, 21). ] : 

In understanding another person, the individual often relies upon aa 
similarity to himself. He can thus use his own experience in interpreting 4 
behavior of another. Although he cannot directly observe another's Tae 
pains or feelings of joy and sadness, he can identify them nonen ai 
pressions, gestures, verbal reports, and other overt cues that he has learned to- 
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associate with his own feelings. Assumed similarity between a newcomer and 
familiar others, such as one's relatives, friends, or former clients, is an extension 
of this mechanism. When this approach is followed in trying to understand 
someone quite unlike oneself—or unlike one's earlier acquaintances—it is 
likely to be misleading. Errors may thus arise when clinicians make diagnostic 
or prognostic inferences about a client whose cultural background, education, or 
socioeconomic level differs markedly from their own (see, eg., 75). 

The Clinician's Data-gathering Function. The diagnostic role of the 
clinician can be described in terms of data gathering and data interpreting. We 
shall begin by considering what the clinician can contribute to the process of 
data gathering. First, by establishing and maintaining rapport, he may elicit 
from the client facts about his life history not readily accessible in other ways. 
Such life-history data provide a particularly sound basis for understanding an 
individual and predicting his future behavior (13, 43). The life history has 
been aptly described as "an unbiased population of events which is as con- 
vincing an operational definition of a person as one can hope for" (13, p. 21). 
It might be added that the more factual data one has from the life history, 
the less fillingin needs to be done in the interpretive process. 

In his search for facts, the clinician can also be guided by the client's own 
responses. He may thus form tentative hypotheses that suggest the direction 
of further probing. Such interlocking of data gathering and interpretation 
represents both an advantage and a potential danger of clinical interviewing. 
On the one hand, it permits more flexibility of search and more effective 
utilization of cues than would be possible with a test, questionnaire, or other 
standardized procedure. On the other hand, if the clinician puts undue con- 
fidence in his early hypotheses, he may look only for data that support them. 
By the type of questions he asks and the way he formulates them or by 
subtle expressions of agreement or disagreement, he may influence what the 
client reports. Sarbin and his associates (63) call this process “soliciting” as 
contrasted to "probing." Swth biased data-gathering techniques undoubtedly 
account for the remarkably uniform etiologies found among the clients of some 
psychoanalysts. 

Another way in which the clinician contributes to the fact-finding process 
is by serving as a stimulus in an interpersonal situation (61, 63). In this regard, 
the clinical interview may be regarded as a situational test (see Chapter 3). 
It provides a sample of the client's interpersonal behavior, observed under 
more or less controlled conditions. 

The Clinician’s Interpretive Function. There has been much discussion 
of the interpretive, or data-processing, function of the clinician. Some have 
argued for a special process of “clinical intuition” that is qualitatively different 
from other forms of inference. Belief in such a process arises partly from the 
fact that clinicians are often unable to report the cues they employ in reach- 
ing a conclusion (7; 63, ch. 8). The mystery disappears, however, when we 
realize that many of our perceptions utilize cues that are inaccessible to self- 
examination. A classical example is visual depth perception. We regularly 
perceive the world in three dimensions, without being able to specify the 
cues we employ in this process. Similarly, after exposure to a test protocol, a 
set of test scores, a case history, or a face-to-face interaction with a client, the 
clinician may assert that the patient is creative, or a likely suicide, or a poor 
psychotherapy risk, even though he cannot verbalize the facts he used in reach- 
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ing such a conclusion. Being unaware of the cues that mediated his inference, 
the clinician is also unaware of the probabilistic nature of the inference, As a 
result, he may falsely associate a feeling of complete certainty with his "in- 
tuition." 

A number of investigations have been concerned with the comparative 
validity of clinical versus statistical predictions (see 25, ch. 10; 35; 47; 50; 51; 
63, ch. 16; 68). Given the same set of facts, such as test scores or life-history 
data, will clinical judgment provide more accurate predictions of subsequent 
behavior than would be obtained by the routine application of a regression 
equation or other empirically derived prediction formula? The large majority of 
studies have found statistical prediction to be at least as good as clinical 
prediction, and sometimes better. 

It is difficult to generalize from such studies, however, because the results 
are undoubtedly affected by the nature of the data to be interpreted, the type 
of criterion behavior to be predicted, and the characteristics of the clinician. 
Not only the amount of training and experience of the clinician, but also the 
soundness of the theoretical framework within which he operates, will in- 
fluence the validity of his judgments. If a clinician is guided by a bizarre and 
unverified personality theory, his prediction success may be poorer than that 
of the crudest statistical procedures. The process of clinical judgment probably 
varies widely from one clinician to another. Insofar as the clinician functions 
as an instrument, he needs to be individually validated, just like a test. More- 
over, since clinical judgment is probably not a single trait, the clinician's 
validity must be separately investigated for different types of prediction, which 
have been classified according to type of client, type of outcome, and other 
relevant variables. ^ 

Several attempts have been made to analyze what clinicians do to arrive at 
their inferences (2, 16, 34, 36, 38, 46, 5o, 63, 74). lt is frequently claimed 
that clinical judgment, as contrasted to statistical data-processing techniques, is 
peculiarly suited to understanding the uniqueness of individuals. In the course 
of his interactions with a client, the clinician is able to develop constructs, 
categories, or trait concepts that are specifically fitted to the individual case. 
He can also take into account patterns of traits or events. His interpretation 
of any single fact, such as a test score, can be made contingent upon other 
variables, such as another test score or some biographical facts. . 

Furthermore, in his interpretations the clinician is able to incorporate rare 
events, whose frequency is too low to permit their use in statistical prediction. 
Although any one such event occurs very infrequently, jt may significantly 
affect outcome in an individual case. Moreover, different "rare events" will be 
encountered often enough to have a substantial effect upon the were ar 
reached in large numbers of cases. In this connection, Meehl quotes the o 
paradox that “an improbable event is one that hardly ever happens, but never- 
theless something improbable happens almost every day” (50, p. 25). i 

In focusing upon the uniqueness of the individual case, we must a ose 
sight of the fact that people also exhibit a great deal of similarity. Wi i^ 
such similarity, the science of psychology could never have developed. E 
fact, society and civilization would be impossible, and human relations wou 
be chaotic. In the clinical context, it is the pronounced similarity among in- 
dividuals that enables the clinician to utilize the technique of assumed m 
ity, to refer to test norms, or to draw upon personality theories develope 
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through research on other persons. With regard to the clinician's consideration 
of patterns and interrelations of facts, it should be noted that such analyses can 
also be incorporated in statistical prediction formulas (see, e.g., 34). Although 
for most purposes linear regression equations based on simple additive combina- 
tion of terms have proved sufficiently accurate, more complex combinations of 
variables can be handled by appropriate mathematical techniques. Such mathe- 
matical models have in fact been used in an effort to discover the type of 
model actually followed by different clinicians (34). 

With diagnostic procedures in their present stage of development, there 
is much that the clinician must do, because neither tests nor statistical pre- 
diction formulas are available to do the whole job. For many years to come, 
clinicians will probably continue to use a combination of clinical judgment 
and such statistical aids as norms, expectancy tables, and regression equations. 
Clinicians need to be constantly aware of the probabilistic nature of all their 
inferences, whether derived from clinical judgment or statistical procedures. 
And they should make continuing efforts to check the validity of thei- pre- 
dictions in terms of empirical follow-ups. Whether it will ever be practicable 
—or even theoretically possible—to program all of the clinician’s diagnostic 
functions on a computer is still a moot point. 


INTELLIGENCE TESTING 


Almost every type of test may be used by a clinician in individual cases. Tests 
of academic achievement, for example, are often useful in clinical work with 
children. Vocational apitude and interest tests may be required to help solve 
a special problem with an adolescent or adult client. Certain types of tests, 
however, were designed chiefly as clinical instruments. It is with these tests 
that we shall be concerned in the remaining sections of this chapter. Only 
a few examples of tests in each category will be cited. For a fuller discussion 
the reader is referred to textbooks on psychological testing (e.g., 1, ro) and 
to books on the practice of clinical psychology (e.g., 22; 28, chs. 16-20; 57). 
Apart from providing an overview of the sort of tests used by clinicians, the 
present survey will familiarize the reader with certain concepts relevant to 
the discussion of therapy and of clinical research to be given in the next 
two chapters. 

Stanford-Binet Intelligence Scale, Among the best-established clinical 
instruments are intelligence tests, Although the concept of intelligence still 
needs much theoretical clarification, traditional intelligence tests have proved 
their worth empirically for many practical purposes. Typically, such tests 
yield an objective index of the individual's general level of intellectual func- 
tioning. The test that has been used for this purpose longer than any other is 
the Stanford-Binet. Originally developed by Terman at Stanford University 
from an earlier test introduced by Binet in 1905, the Stanford-Binet was most 
recently revised in 1960 (71). This form of the test extends from the age of 
two years to the superior adult level, but it is used most commonly with chil- 
dren. Most of the items are oral, although some require the subject to read, 
write, draw, or carry out simple manipulative tasks. 

Like most intelligence tests designed for clinical use, the Stanford-Binet is 
an individual test. It can be administered to only one subject at a time and re- 
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quires a trained examiner. Although more time-consuming and much harder 
to give and score than group tests, individual tests provide additional informa- 
tion for the clinician. Thus in such a situation it is possible to observe the 
subject's work methods, attitudes, and problem-solving techniques, as well as 
the finished product. Interpersonal reactions, emotional responses, and other 
incidental behavior can likewise be noted. Individual tests also provide a better 
opportunity for establishing rapport with each subject than do group tests. 

The Stanford-Binet yields a single score expressed as a deviation IQ with 
a mean of roo and an SD of 16. A particular advantage of this test stems from 
the wealth of interpretive data that has accumulated through many years of 
clinical use. Clinicians have thus had a chance to learn inductively what to 
expect from individuals at different Stanford-Binet IQ levels. 

The Wechsler Scales. The Wechsler Adult Intelligence Scale (WAIS), 
like its earlier version, the Wechsler-Bellevue, was specially developed as a 
clinical instrument for adult testing (77, 78). Standardized in 1955 on a 
representative nationwide adult sample, the WAIS provides separate norms 
for successive age levels between sixteen and sixty-four years. Supplementary 
norms were also established on a special old-age sample extending to seventy- 
five years and over, tested in a Middle Western city. 

The WAIS yields a total deviation IQ (M = 100, SD = 15), as well as 
separate Verbal and Performance IQs. The Verbal IQ is based on six orally 
administered tests utilizing verbal and numerical content; the Performance 
IO is derived from five subtests using nonverbal symbols, pictures, geometric 
forms, and cutout parts of objects to be assembled by the subject. A simple 
item used for demonstration purposes in one of the picture tests is reproduced 
in Figure 92. The subject's task in this test is to rearrange a set of cards in 
the proper order so as to tell a story. Figure 93 shows a subject taking another 
of the performance tests, in which designs are copied by arranging blocks with 
painted sides. 

The Wechsler Intelligence Scale for Children (WISC) was prepared as a 
downward extension of the original Wechsler-Bellevue (76). It, too, provides 
Verbal, Performance, and Full Scale IQs and is quite similar to the WAIS 
in form and content. Norms are available for every fourmonth interval be- 


tween the ages of five and fifteen years. 


Fig. 92 Demonstration item from the WAIS Picture Arrangement Test. 
(Reproduced by permission. Copyright 1955, The Psychological Corporation.) 
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Fig. 93 A subject taking the WAIS Block Design Test. (Courtesy of 
The Psychological Corporation.) 


Performance and Nonlanguage Scales. A number of less well-known 
tests have been developed for use with special groups, such as infants and 
preschool children (see 1, ch. 11). Others have been designed for testing per 
sons unable to take the usual type of test, such as the deaf, the orthopedically 
handicapped, children with reading disabilities, illiterates, and foreigners with 
a language handicap. There are several nonlanguage group tests that require 
no reading; some can be administered without the use of language altogether, 
the instructions being presented by means of gestures, pantomime, and demon 


strations. For individual testing, performance tests are generally employed for 
several of the special handicaps mentioned above. Since most nonlanguage and 
performance tests tap chiefly perceptual and spatial rather than verbal and 
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numerical functions, they are also used in clinical practice to provide a better- 
rounded picture of the individual’s intelligence. The Performance Scale of the 
previously mentioned WAIS and WISC was included chiefly for this purpose. 

In a few tests, the attempt has been made to eliminate, not only language, 
but also knowledge and skills that are specific to a particular culture. An ex- 
ample of such an intelligence test designed for cross-cultural testing is the Pro- 
gressive Matrices (58), previously described and illustrated in Chapter 3. 
This test, it will be recalled, consists of 60 matrices, or designs, from each 
of which a piece has been removed. The subject must choose the missing piece 
from six or eight given inserts. Because it is a nonverbal test requiring the 
eduction of relations among abstract items, this test is particularly useful in 
clinical practice as a supplement to the usual verbal type of intelligence test. 
Still another application is in the testing of persons with severe motor handi- 
caps, as in cerebral palsy, with whom the Progressive Matrices have proved 
quite successful. When administering the test to such cases, the examiner 
points to each piece in turn, while the subject signifies assent or dissent 
through some convenient signal. 

Tests of Intellectual Impairment. Several types of tests have been de- 
signed as clinical instruments for assessing intellectual impairment. Such 
tests are used to detect intellectual deterioration resulting from brain damage 
Cor “organicity”), psychotic disorders, senility, or other pathological conditions. 
They are sometimes employed to help differentiate between mental deficiency 
and psychosis, as well as between mental defectives with or without brain 
damage. Many of these tests are only loosely standardized, being designed 
chiefly to give the clinician an opportunity to make qualitative observations. 
Others have relatively standardized procedure and empirical norms. — 

Available tests of intellectual impairment are based on the premise of a 
differential deficit in different functions. Among the functions found to be 
most sensitive to pathological processes are memory for newly learned material, 
perception of spatial relations, and abstraction, or concept formation. The in- 
dividual's performance on these tests is usually compared with his performance 
in a function that is relatively resistant to deterioration. Vocabulary test scores 
or scores on predominantly verbal intelligence tests are often used ior the 
latter purpose. An index of the individual's prior intellectual level, from w »* 
he subsequently deteriorated, can also be obtained from his educationa or 
occupational status or from intelligence tests he may have taken in the past. 

The Wechsler scales are sometimes used to find a deterioration index, as well 
as to identify diagnostic signs of various psychotic and neurotic disorders (78). 
All these measures are based on inequalities of performance from one subtest 
to another, Although employed by some clinicians and recommended e E 
test author, these applications of the Wechsler scales have many technica 
weaknesses that make them of doubtful value Csee 1, pp. 320-325). ; 

An example of a special test for measuring intellectual impairment is the 
Benton Visual Retention Test (5). This test combines perception and memory 
by having the subject reproduce designs from memory. In the procedure 
usually followed, each of 10 cards containing simple geometric figures is ex 
posed briefly. Immediately upon its removal, the subject draws what he saw 
on the card. 

Another well-known type 
investigate the subject's concept formation (se 


of test utilizes the sorting of objects or forms to 
e 1, pp. 329-336; 26; 57, vol. I, 
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Fig. 94 Objects employed in a sorting test for evaluating concept 
formation. (Courtesy of The Psychological Corporation.) 


ch. 3). Thus the subject may be given a set of circles, squares, and triangles 
in different colors, with the instructions to place together those that belong 
together. Whether he groups the pieces according to form or color, he is then 
asked to sort them in another way. In this test, interest is focused on the sub- 
ject's ability to shift from one basis of classification to another. In the sorting 
test illustrated in Figure 94, common objects are employed in a variety of 
sorting situations. For example, the examiner selects an article and asks the 
subject to pick out the others that belong with it. The examiner also places 
objects in groups on some basis, such as color, shape, material, or use, and asks 
the subject to say why they belong together. The same object may be included 
in several different groupings, such as eating utensils, toys, and metal objects. 
Hence the subject is again required to shift from one attribute of an object to 
another in the reclassifications. 


PERSONALITY TESTING 


The term "personality test" is conventionally used to designate techniques for 
assessing the emotional, social, and motivational aspects of behavior. It should 
be noted that neither personality nor intelligence represents an entity. Each 
refers merely to certain characteristics of the individual's behavior. Moreover, 
everything the individual does is influenced by both intellectual and personality 
factors. How well the individual performs an intellectual task depends in part 
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upon his motivation, interests, and emotional adjustment. Conversely, the 
way in which he resolves an emotional difficulty and his reaction to psycho- 
therapy are affected by his intellectual level. Application of the previously 
discussed tests of intellectual impairment demonstrates the effects of severe 
emotional difficulties upon intellectual functioning. 

Although personality tests vary widely with respect to reliability, validity, 
and adequacy of norms, such tests as a whole are not so well constructed nor 
so successful as those in the intellectual area. For this reason, personality tests 
are employed chiefly as exploratory tools in clinical practice. In assessing 
personality traits, most clinicians rely heavily upon life-history and interview 
data. Of the many personality tests available to the clinician, we shall con- 
sider three major types: (1) self-report inventories, (2) projective techniques, 
and (3) measures of self concepts and personal constructs. 

Self-report Inventories. In a self-report inventory, the individual answers 
a series of written questions about his typical feelings, attitudes, and actions. 
This technique has been used in measuring such varied characteristics as voca- 
tional interests, race prejudice, interpersonal relations, and neuroticism. Stand- 
ardized inventories have been prepared for use with different age groups, from 
the elementary school to the adult level. Some are designed for essentially 
normal persons; others were developed within clinical settings to detect emo- 
tional maladjustments. : 

Among clinically oriented inventories, the most widely known is un- 
doubtedly the Minnesota Multiphasic Personality Inventory, or MMPI (12, 
17, 29, 30, 31, 80). Originally developed “to assay those traits that are com- 
monly characteristic of disabling psychological abnormality,” the MMPI con- 
sists of 550 statements which the subject is to classify as True, False, or Cannot 
Say with regard to himself. In the individual form of this test, which is the 
form more commonly used in clinical practice, each statement is printed on a 
card, and the subject places the card in the appropriate category in a box. The 
items range widely in content, covering such areas as; health, psychosomatic 
symptoms, neurological disorders, and motor disturbances; sexual, religious, 
political, and social attitudes; educational, occupational, family, and marital 
questions; and common neurotic or psychotic behavior disturbances, such as 
obsessive and compulsive states, delusions, hallucinations, ideas of reference, 
phobias, and the like. A few illustrative items are reproduced below: 


I do not tire quickly. 

I am worried about sex matters. : 

When I get bored I like to stir up some excitement. 
I believe I am being plotted against. 


In its regular administration, the MMPI provides scores on the following 
to scales: 


: Hy iasi: 6. Pa: Paranoia 
^ p. aie 7 Pe Psychasthenia 
3. Hy: Hysteria 8. Sc: Schizophrenia 
4. Pd: Psychopathic deviate 9. Ma: Hypomania — 
5. Mf: Masculinity-femininity 0. Si: Social introversion 


Eight of these scales consist of items that differentiated between ee with 
specified clinical syndromes and a normal control group of approximately 700 
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persons. These scales were developed empirically by criterion keying of items, 
the criterion being traditional psychiatric diagnosis. Items for the Masculinity- 
femininity (Mf) scale were selected in terms of frequency of responses by men 
and women. The Social introversion (Si) scale was added later, on the basis 
of research on normal subjects. A special feature of the MMPI is its utilization 
of four so-called "validity" scales (?, L, F, K). These scales are not concerned 
with validity in the technical sense but provide checks on carelessness, mis- 
understanding, malingering, and the operation of special response sets and 
test-taking attitudes. 

Scores on the MMPI scales are expressed as standard scores with a mean 
of 5o and an SD of 1o. These standard scores are used in plotting profiles, as 
illustrated in Figure 95. A score of 7o or higher—falling 2 SD's or more above 
the mean—is generally taken as the cutoff point for identifying possible 
pathological deviations. The interpretation of MMPI profiles is much more 
complex than the labels originally assigned to the scales might suggest. The 
test manual and related publications now caution against literal interpretation 
of the clinical scales. For example, we cannot assume that a high score on the 
Schizophrenia scale indicates the presence of schizophrenia. Other psychotic 
groups show high elevation on this scale; and schizophrenics often score high 
on other scales. Moreover, such a score may occur with a normal person. 1t is 
partly to prevent possible misinterpretations of scores on single scales that the 
code numbers o to 9 have been substituted for the scale names in later publica- 
tions on the MMPI. 

The original clinical scales of the MMPI were based on a traditional psy- 
chiatric classification which, though popular, rests upon a questionable theo- 


Fig. 95 MMPI profiles of a normal adult and a psychotic. (Adapted from 
Weider, 79, pp. 554, 563. Copyright 1953, The Ronald Press voe 
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retical foundation. As we saw in the first section of this chapter, the artificiality 
of these categories has been a matter of concern in clinical psychology for a 
long time. The fact that such categories prove unsatisfactory in actual practice 
s now generally conceded. Another difficulty is that a high score on any one 
MMPI scale may have different implications depending upon the individual's 
scores on other scales. It is the score pattern, or profile, rather than the scores 
on individual scales that must be considered. To facilitate the interpretation of 
score patterns, a system of numerical profile coding has been developed. In 
such codes, the sequence and arrangement of scale numbers show at a glance 
which are the high and which the low points in the individual's profile. 
For instance, the code 49-2 signifies high scores in scale 4 (Pd) and scale 9 
Ma) in decreasing order of size and a low score in scale 2 (D). A mass of 
interpretive clinical material has been gathered and published, showing the 
characteristics of persons who obtain each coded profile on the MMPI (see 12, 
17, 30, 31). 

The MMPI has also served as an item pool for the construction of about 
200 special scales, most of them developed by independent investigators to 
meet specific needs (see 12). Both normal and clinical populations have been 
employed in selecting items for these scales against a wide variety of criteria. 
Examples include scales assessing Ego Strength, Manifest Anxiety, Depend- 
ence, Dominance, Prejudice, and Social Status. 

Insofar as items are selected and their scoring weights determined by em- 
pirical criterion keying, they may be regarded as verbal stimuli to which the 
subject responds when he takes the inventory. These responses are interpreted 
in terms of their empirically established correlations with criterion data. Thus, 
if a person reports that he has been generally unsuccessful in his undertakings, 
the point at issue is not his actual degree of success but the fact that he gives 
such a response. This approach to self-report inventories is effectively charac- 
terized by Meehl in a statement about the MMPI which was quoted in 
Chapter 11 (p. 301) in a different connection.! } 

Projective Techniques. Although projective techniques have been em- 
ployed for a variety of purposes—including personnel selection, executive 
evaluation, and motivation research—their principal application is in clinical 
psychology. Most of them require individual administration by a specially 
trained examiner. And they yield a mass of qualitative data along with Cor in 
place of) quantitative scores. It will be recalled from Chapter 3 that er 
techniques as a group are characterized by the use of a relatively wnstructure 
task, which allows considerable freedom in the subject’s response. The under- 
lying hypothesis is that, when test stimuli and instructions are vague and 
equivocal, the individual’s responses will reveal his own basic personality 
characteristics. ; 

There is almost no limit to the kinds of tasks or situations that can serve 
as a basis for projective testing. Available techniques are many and varied. 
One of the earliest was the "free association test," in which the subject is given 
a list of disconnected words, to each of which he must respond with the first 
word that comes to mind. Another test employing verbal stimuli is the sentence 
completion test. Subjects are given the opening words of a series of sentences 


were described in Chapter 3. The use of 


: ity inventories i 
o ee ce f interests is discussed in Chapters 3 and 
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and are required to write suitable endings. Frequently the instructions stress 
the fact that the sentences should express the subject's own feelings. ‘The 
sentence stems are chosen so as to permit wide latitude in the responses. 
Typical examples are: “I feel . . .”; “What annoys me . . .”; "My er. 
. . .” The responses are evaluated or coded in various ways. Thus they sht 


be rated for degree of maladjustment indicated; or they might be checked for 
the number of references to a particular theme, such as insecurity, hostility, or 
parental conflict. 


One of the best-known projective instruments, the Rorschach, utilizes ink- 
blots as stimuli (59). Developed in 1921 by the Swiss psychiatrist Herman 
Rorschach, this test utilizes 10 cards, each containing a bilaterally symmetrical 
but otherwise irregular blot, similar to that illustrated in Figure 96. Some of 
the cards are in black, white, and gray; others are in color. As the subject is 
shown each blot, he is asked to tell what he sees—what the blo uld 
represent. Different systems of administration vary in the details of procedure 
and scoring, but all require a good deal of qualitative interpretation in t lat- 
ing the subject's responses into broad personality characteristics. Although con- 
tent is considered to some extent, the most common scoring systems are based 
predominantly on formal characteristics of the responses. For example res 
are derived from such data as: number of associations elicited by the whole 
blot and number elicited by details of the blot; number of times the subject 


Fig. 96 An inkblot of the type employed in the Rorschach technique. 
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responds to form and to color; frequency with which he perceives movement, 
as well as kind of movement perceived (human, animal, etc.); and the ways in 
which the subject perceives shading. 

Several projective techniques employ pictures as stimuli. In the Thematic 
\pperception Test CTAT), which was cited in Chapter 3, black and white 
pictures are presented individually with the instructions to make up a story 
to fit each picture (53). The subject is told to describe what is happening 
in the picture, to indicate what the characters are feeling and thinking, and 
to give the outcome. The content of the pictures is sufficiently unstructured 
to permit wide freedom of interpretation. A typical TAT picture is repro- 
duced in Figure 97. Evaluation of responses is largely qualitative, drawing 
heavily upon Murray's personality theory. 

A different use of pictures is illustrated by the Rosenzweig Picture- 
Frustration Study (P-F), based on the authors theory of frustration and 
aggression (60). The test booklet contains a series of cartoon-like drawings, 
each depicting a frustrating situation commonly met in daily life. One of the 
characters in each cartoon is saying something that either causes the frustra- 


Fig. 97 One of the pictures used in the Thematic Apperception Test. 
CReprinted by permission of the publishers from Henry Alexander Murray, 
Thematic Apperception Test, Cambridge, Mass.: Harvard University 

Press, Copyright, 1943, by The President and Fellows of Harvard College.) 


362 


CLINICAL PSYCHOLOGY 


tion or calls attention to it. The reply of the frustrated character is to be 
filled in by the subject. One item from this test is shown in Figure 98. 
Responses are analyzed in various ways. For example, direction of aggression 
is classified as “extrapunitive,’ or turned outward upon the environment; 
“intropunitive,” or turned inward upon the subject; and "impunitive," or 
turned off in an attempt at glossing over or evading the situation. 

With regard to the contributions they can make to clinical assessment, 
projective techniques differ widely among themselves. Some appear more 


promising than others because of sounder theoretical orientation, more favor 
able empirical findings, or both. About some there is a voluminous rese ch 
literature; about others little is known. Although each instrument obviously 


must be evaluated on its own merits, certain general statements can be made 
about the projective approach to personality assessment. 

On the positive side, projective techniques are effective in "breaking the 
ice" and establishing rapport during initial contacts with a client. They are 
also applicable to a wide variety of subjects. Several available techniques 
involving nonverbal media can be used successfully with young children, 
illiterates, or persons with language handicaps. Projective tests, moreover, are 


Fig. 98 Typical item from Rosenzweig Picture-Frustration Study. 
(Reproduced by permission of Saul Rosenzweig.) 


I'm very sorry 
we splashed 
your clothing 
just now 

though we tried 
hard to avoid 
the puddle. 


e 
“Ary. ( 
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less susceptible to faking than are self-report inventories, since their purpose 
is generally disguised. In addition, the subject soon becomes engrossed in the 
task and is hence less likely to resort to the habitual restraints and defenses of 
interpersonal communication. At the same time, because of its unstructured 
nature, the task is usually less threatening than those encountered in intelli- 
gence and aptitude tests. 

When evaluated in terms of their technical qualities as tests, however, 
projective instruments generally fare rather poorly. Most are inadequately 
standardized with regard to both administration and scoring procedures. Con- 
sequently, results are likely to vary as a function of the examiner. With a 
few notable exceptions, norms are inadequate. Normative data either are lack- 
ing or are based on small and unrepresentative samples. Frequently, not 
enough information is given about normative samples to permit an evaluation 
of the norms. For many projective tests, reliability is dificult to determine 
because of the nature of the test and the absence of parallel forms. When ob- 
tained, reliability coefficients are often low. Empirical studies of validity have 
also yielded disappointing results for most projective tests. Positive findings 
have sometimes been obtained because of inadequate controls and other 
methodological deficiencies. Le 

Some of the special problems encountered in the evaluation of projective 
techniques will be considered in Chapter rs, in connection with clinical 
research. In their present state of development, projective techniques should 
be regarded as interviewing aids rather than as psychometric tools. ‘Their 
chief usefulness lies in providing leads to be followed up by the clinician. 

Self Concepts and Personal Constructs. Although self-report inventories 
and projective techniques represent the most numerous and best-known devices 
for personality assessment, many other approaches are being actively pursued 
(see 1, ch. 21). Investigators have experimented with tests of perception, 
controlled association, aesthetic preference, and humor responses, among 
others. Of particular interest for clinical psychology are techniques for SET 
ing self concepts and personal constructs. These techniques are closely a p 
with some of the personality theories and systems of psychotherapy p ; 
discussed in the next chapter. A common feature of such techniques a t = 
concern with the way in which the individual views himself ae ot e. 3 
this respect, they reflect the influence of phenomenological psycho i mh di 
focuses on how events are perceived by the individual (41, 65). e indi- 
vidual's self-description thus acquires primary importance in Pom rig n 
rather than being regarded as a poor substitute for other types o ane 
observations. Interest also centers on the extent of self-acceptance shown by 


the individual, 1 jim 
One of the special procedures for investiga cepts is 
developed by Senk (67). In this technique, the subject is given a set of 


cards containing statements or trait names and is asked to sort uen s 
according to their applicability to himself, from "most characteristic p eas 
characteristic.” The items may come from a standard list but more often are 
designed to fit the individual case. To ensure a uniform distribution of SEEN 
a forced-normal distribution may be PE = wa e instructed to place 
à specifie ber of cards in each pile (see Chapter 4). 

O Bex Ra been employed to study a variety of psychological € 
In the clinical investigation of individual personality, the subject is often aske 


gating self concepts is the Q sort, 
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to re-sort the same set of items according to different frames of reference. For 
example, he may sort the items as they apply to himself and to other persons, 
such as his father, mother, wife, or therapist. Q sorts can also be obtained for 
the individual as he actually thinks he is (real self) and as he would like to be 
Cideal self). A cumulative measure of the differences between real- and ideal- 
self sorts provides an index of self-acceptance. To observe change, Q sorts may 
be obtained repeatedly at different stages in the course of psychotherapy. 

Q technique represents an attempt to systematize self-rating procedures. It 
can also be employed as a basis for rating others. For example, a clinician 
may record his evaluation of an individual by means of a Q sort. Although 
some of the statistical procedures that have been employed in analyzing Q-sort 
data are questionable (11), the Q sort provides a useful rating technique for 
both clinical practice and research. 

Another relevant technique is the Semantic Differential, originally developed 
by Osgood and his associates (54) as a tool for research on the psychology of 
meaning. This technique has subsequently proved useful in a variety of ‘ields, 
including that of personality assessment. The Semantic Differential represents 
a standardized and quantified procedure for measuring the connotations of any 
given concept for the individual. Each concept is rated on a seven-point bipolar 
graphic scale defined by a pair of opposites such as good-bad, strong-weak, or 
fast-slow. Every concept to be investigated is rated in turn on each scale, as 
illustrated below: 


CHILD eae eee: eee er s 4 small 
HATRED line ee ee ee eee. relaxed 
SEX hona eei weak 
CHILD vada tet WOrthless 


Some of the scales can be applied literally to a particular concept, as in the 
rating of CHILD on the large-small scale. Many of the scales are obviously 
interpreted figuratively, as when a person is described as “cold.” When a scale 
appears totally inapplicable to a concept, the subject would presumably check 
the middle position. 

Intercorrelations and factorial analyses of different scales have revealed 
three major factors: Evaluative, with high loadings in such scales as good-bad, 
valuable-worthless, and clean-dirty; Potency, found in such scales as strong: 
weak, large-small, and heavy-light; and Activity, identified in such scales as 
active-passive, fast-slow, and sharp-dull. The evaluative factor is the most 
conspicuous, accounting for the largest percentage of total variance. Responses 
on the Semantic Differential can be analyzed in several ways. For example, the 
connotations of all concepts rated by an individual can be investigated by 
computing the “score” of each concept in the three principal factors described 
above. The factor loadings can be more easily visualized by means of three- 
dimensional models, as illustrated in Figure 99. These models are taken from 
an intensive study of a case of multiple personality, in which the patient 
shifted back and forth between sharply contrasted “selves” (72, 73). The one 
self, designated “Eve White,” was meek, self-critical, frustrated, and unhappy: 
The other, “Eve Black,” was irresponsible, self-centered, fun-loving, an 
mischievous. "Jane," a third, more highly integrated and betteradjusted per 
sonality, emerged in the course of therapy and seemed eventually to be 
replacing the other twa ] 
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Doctor 


Eve White Eve Black 


Fig. 99 Semantic Differential patterns found in a case of multiple 
personality. (From Osgood and Luria, 55, pp. 584, 585.) 


An independent analysis of the Semantic Differential patterns of Eve White 
and Eve Black led to personality descriptions that agreed remarkably well with 
the case reports of the therapists (54, 55). Figure 99 shows these patterns as 
they appeared early in therapy. In these diagrams, “good” is at the top, "active" 
at the left, and “weak” toward the reader. The dark circle is the zero point for 
all three dimensions. The dark line connecting this circle to Doctor Cwho 
retained approximately the same position in all diagrams) serves to orient the 
patterns in "semantic space." A number of contrasts are apparent between the 
two patterns. Eve White places Me much lower in the evaluative dimension 
than does Eve Black. The latter also shows less differentiation of concepts 
along the evaluative (up-down) dimension. Eve Black’s distorted values are 
further indicated by her placing Hate and Fraud along with Peace, Father, 
Doctor, and herself toward the good end of the scale, while Child, Spouse, 
Job, Sex, and Love are classified with Confusion and Sickness toward the bad 
end. In Eve White's pattern, the separation of Love and Sex and the neutral 
position of Spouse reflect some of her adjustment difficulties in marriage. 
Changes occurring in the course of therapy were accompanied by shifts in the 
position of concepts in later Semantic Differential patterns. 

A technique devised specifically as an aid in clinical practice is the Role 
Construct Repertory Test (Rep test) developed by George A. Kelly (41). This 
test has certain features in common with several other personality tests, notably 
the Semantic Differential and the various sorting tests used to study concept 
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formation. In the Rep test, however, the objects to be sorted are persons who 
are important in the subject's life. And, unlike the Semantic Differential, the 
Rep test requires the subject himself to designate the categories or constructs 
in terms of which he classifies these persons. The Rep test was developed 
within the context of Kelly's personality theory. A basic proposition of this 
theory is that the concepts or constructs an individual uses to perceive objects 
or events influence his behavior. In the course of psychotherapy, it is frequently 
necessary to build new constructs and to discard some old constructs before 
progress can be made. 

The Rep test is designed to help the clinician identify some of the client's 
important constructs about people. Although the test can be administered in 
many ways, one of its simpler variants will serve to illustrate its essential 
characteristics. In this variant, the subject is first given a Role Title List and 
asked to name a person in his experience who fits each role title. A few 
examples are: a teacher you liked; your father; your wife or present girl 
friend; a person with whom you have been closely associated recently who 
appears to dislike you. The examiner next selects three of the persons named 
and asks, "In what important way are two of them alike but different from the 
third?" This procedure is repeated with many other sets of three names, in 
which some of the same names may occur in different combinations. The Rep 
test yields a wealth of qualitative data. A simplified factoranalytic procedure 
has also been developed for the quantitative identification of constructs that 
are important for each individual. 

An example of the many variations that can be introduced into the Rep test 
is provided by a study of adolescent adjustment (40). In the adaptation 
developed in this study, the subject uses each of the constructs that emerge 
from the first part of the test to rate himself as he is at the time, as he was five 
years ago, and as he would like to be five years hence. He also estimates how 
each person named earlier in the test would rate him on the same constructs. 
Several of the measures obtained by this procedure differentiated significantly 
among a group of adolescents rated as well adjusted by their teachers, a group 
rated as poorly adjusted, and a group referred to a clinic for emotional problems. 


SUMMARY 


While functioning in a variety of settings, clinical psychologists are typically 
concerned with individuals who are experiencing difficulties of an emotional 
or intellectual nature. The principal activities of clinical psychologists can be 
classified as diagnosis, therapy, and research. These activities are frequently 
intertwined, their relative importance varying widely with the clinician. 

In its broadest sense, diagnosis corresponds to the fact-finding aspect of 
clinical practice. Its objectives include screening and classification, personality 
description, prediction of outcome, and attainment of client insight. In the 
classification of psychological disorders, a basic distinction is that between 
mental deficiency and emotional problems, the latter being further subdivided 
into psychoses and neuroses. More specific syndromes have been identified, but 
the application of these traditional psychiatric categories has many drawbacks. 
Characteristically, the clinical psychologist is interested in a detailed person- 
ality description of the individual case, with its unique combination of prob- 
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lems, adjustment mechanisms, and antecedent circumstances. The predictive 
validity of both tests and life-history data has been investigated against such 
criteria as reaction to therapy, postinstitutional adjustment, and improvement 
»r deterioration in the absence of therapy. The interrelation of diagnosis and 
therapy is illustrated by those forms of psychotherapy that emphasize the 
therapeutic value of the client's insight into the nature and origin of his 
problems. 

The major sources of data available to the clinician include case history, 
psychological tests, and diagnostic interview. Considerable discussion and some 
research have been devoted to the nature and effectiveness of clinical judgment. 
The clinician’s diagnostic task can be regarded as a special case of the general 
process of person cognition. With regard to both data gathering and interpreta- 
tion, there are a number of ways in which the services of a skilled clinician are 
needed. Whether objective diagnostic techniques and statistical prediction for- 
mulas can eventually assume the entire diagnostic function remains a debatable 
question. 

Currently, clinicians utilize a number of psychological tests in conjunction 
with life-history and interview data. Among the tests most frequently used for 
clinical purposes are individual intelligence tests, special tests of intellectual 
impairment, personality inventories, projective tests, and various measures of 
self concepts and personal constructs. 
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Therapy 


Many kinds of therapy are used today in treating emotional disorders. The 
choice of therapy depends partly upon the nature and severity of the disorder 
and upon such client characteristics as age and intellectual level. To a con- 
siderable extent, however, the type of therapy employed reflects the training 
and theoretical persuasion of the therapist, the available facilities, and other 
extraneous factors. The same behavioral symptoms may be treated quite 
differently by different therapists; and several types of therapy may be com- 
bined in treating a single individual. There are very few forms of treatment 
that are wholly specific to a given emotional disorder. In psychology, as con: 
trasted to medicine, one rarely if ever finds a disorder in which the diagnosis 
directly determines the treatment. 

Evaluating the relative effectiveness of different forms of therapy for emo- 
tional disorders is very difficult. Some of the methodological problems presented 
by such evaluative studies will be illustrated in Chapter 15. For the present it 
should be noted that there is little conclusive evidence to demonstrate that one 
therapy is better than another for any given case, or even that any therapy is 
effective at all. Both clinical psychologists and psychiatrists are often in the 
position of applying a therapeutic procedure because they feel it will do no 
harm and may do some good. But there is no certainty about how well any 
given therapy will work or why. The rationale underlying different types of 
therapy varies from unknown to tentative. Since the practical demands of the 
situation call for some action, the therapist does the best he can within the 
available state of knowledge. 

In the present chapter, the different approac 
under four major headings: somatic therapies, 
chotherapy, and the preventive and positive measures rep 
prehensive mental health point of view. 


hes to therapy will be considered 
environmental therapy, psy- 
resented by a com- 


SOMATIC THERAPIES 
All somatic therapies are administered by a medical practitioner or under his 
immediate supervision. In the use of these therapies, psychologists may col- 
laborate with physicians in several ways. The clinical psychologist may help 
in determining which patients are most likely to benefit from a particular 
therapy and in evaluating progress in the course of therapy. Sometimes psy- 
chotherapy, conducted by a psychologist, is combined with somatic therapy. 
In some cases, in fact, somatic therapy may be administered chiefly to make the 
patient more amenable to psychotherapy. Perhaps the most important contribu- 
tion psychologists can make to the development of somatic therapies is in the 
area of research. The special methodological controls and experimental designs 
required in this type of research provide a fruitful field for the psychologically 
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trained investigator. It is cniefly as background for some of the research prob- 
lems to be considered in Chapter 15 that somatic therapies are introduced in 
this section. 

Somatic therapies are themselves quite varied, being grouped together only 
because of their common medical nature (see 38). The introduction of a new 
somatic therapy usually arouses widespread optimism and exaggerated expecta- 
tions, only to be followed by skepticism and a more sober evaluation. Some 
therapies that were popular one or two decades ago have been largely replaced 
by newer techniques. À few are currently in an exploratory stage. For example, 
some research has been conducted on the effects of nutritional therapy, in- 
volving the experimental administration of glutamic acid, selected vitamins, 
or other nutritive substances to both mentally retarded and emotionally dis- 
turbed subjects (3, 87). The results, although suggestive, are still inconclusive 
and controversial. 

In the practical application of any somatic therapy, one must consider the 
likelihood of undesirable side effects, of either a physiological or psychological 
nature. Some side effects have been demonstrated with all the commonly u 
somatic therapies, although the incidence and severity of such effects vary 
with the individual patient, the dosage level, and other circumstances. Little is 
known, however, about the specific patient variables that determine the rather 
pronounced individual differences found both in side effects and in responsive- 
ness to the various somatic therapies. Among the major somatic therapies are 
psychosurgery, shock therapy, and the many therapies covered by the broad 
area of psychopharmacology. 

Psychosurgery. The results of animal experiments on the extirpation of 
brain tissue, as well as the observation that human personality changes some- 
times followed brain injuries or the removal of frontal lobe tumors, led some 
neuropsychiatrists to try brain surgery in the treatment of emotional disorders 
(see 23, 38, 44, 90). Psychosurgery is a general name for the application of 
surgical techniques in treating functional, as contrasted to organic, disorders 
(see Chapter 13). Several specific procedures have been developed for this 
purpose. The original technique, commonly known as “lobotomy,” involved 
the severing of the corticothalamic tracts that connect the thalamus with the 
frontal lobes of the cerebral cortex. Later variations limited the operation to 
more narrowly circumscribed areas and employed simpler surgical procedures. 

Various combinations of neurological and psychological mechanisms have 
been proposed to account for the therapeutic effects of psychosurgery Csee go). 
Available evidence in support of any of these explanations is meager. The most 
clearly demonstrated effects are a reduction in anxiety and in emotional tension. 
Such improvement, however, may be accompanied by one or more undesirable 
effects, such as apathy, emotional flatness, decrease in drive and energy, ir- 
responsibility, aggressiveness, poor judgment, impaired skill in problem solv- 
ing, increase in perseverative responses, and loss in spontaneity. Such be- 
havioral effects have been demonstrated through clinical observations of human 
patients, animal experiments, or both (go). "Today psychosurgical techniques 
are considered only when other, less drastic forms of therapy have failed. 

Shock Therapy. Under the general category of shock therapy are grouped 
a number of procedures for artificially inducing comatose or convulsive states. 
The specific techniques were usually suggested by accidental observations made 
either in the treatment of other disorders in human patients or in the course 
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animal experimentation. Although several hypotheses have been formulated to 
explain the rationale of such therapies, in either physiological or psychological 
terms, none has been conclusively established (14, 79). The principal agents 
currently employed in shock therapy are insulin and electricity. 

Insulin coma therapy (38) is employed mainly with schizophrenics. In this 
trcatment, insulin is administered in large enough doses to cause severe hypo- 
glycemia, or sugar deficit in the blood. When the brain is thus deprived of its 
chief food, the individual sinks into a coma. He is allowed to remain in this 
condition for periods ranging up to about an hour, at which time the coma is 
terminated by the administration of sugar. The treatments are usually repeated 
daily over a few weeks. As the patient recovers from each comatose period, he 
exhibits confusion, amnesia, perceptual disorientation, and other disturbances, 
which gradually disappear. 

In electroconvulsive therapy (ECT), convulsive seizures are induced by 
passing an electric current through the cerebral cortex C14, 38, 79). The patient 
feels no pain, since the electric shock produces unconsciousness immediately. 
The seizure lasts about a minute, but the patient usually remains stuporous for 
about an hour. Treatments are generally administered several times a week, the 
total duration of the series depending upon the patient's response to treatment 
and his rate of improvement. ECT is used chiefly with cases in which depres- 
sion is either the principal feature or a major symptom. Severe impairment of 
memory is an immediate effect of the treatments but lessens gradually. Whether 
there is any permanent cognitive loss as a result of ECT or other types of 
shock therapy is difficult to ascertain because of inadequate controls in most 
investigations with human patients. Animal experiments, however, suggest that 
some permanent cognitive changes may occur (see 14, 79). In the decision to 
utilize any somatic therapy, the therapist must weigh the possibility of perma- 
nent deleterious side effects against the chances that the patient will remain 
in a seriously disturbed condition or even deteriorate if untreated. 

Psychopharmacology. Drugs have been used in the treatment of menta 
disorders in many ways and for a long time (see 15, 38, 83, 84). Mention has 
already been made of insulin coma therapy. Prior to the introduction of electro- 
shock, several drugs were also employed to induce convulsions. A different 
application of drugs is illustrated by narcoanalysis, in which psychotherapeutic 
techniques are introduced while the patient is in a narcotically induced trance. 
The relaxed state produced by the drug permits the patient to relive traumatic 
experiences that would otherwise have provoked too much anxiety. * 

Still another example is provided by the use of drugs in research on the 


nature of mental disorders, conducted with either animal subjects or normal 


volunteers (83; 84, ch. 1). Certain drugs, when administered to normal indi- 
viduals, produce temporary symptoms similar to those found in psychotic 
patients. Because they "mimic" psychotic symptoms, these drugs have been 


named psychotomimetic. Since hallucinations are among their usual effects, 


they are also called “hallucinogens.” Considerable research has centered around 
lysergic acid diethylamide (LSD), which induces schizophrenic-like ác 
including disturbances of perception and thought, hallucinations, and m! 
changes. , 
Of all the psychoactive drugs, those that have aroused the doe ame 
interest are undoubtedly the so-called tranquilizers. In general, the effect o 
such drugs is to reduce anxiety and agitation. They are now widely used in 
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mental hospitals to calm hyperactive and aggressive patients. As a result, there 
has been a sharp decline in the need for physical restraint, isolation, locked 
wards, and other traditional methods for controlling highly disturbed patients. 
It has also been reported that tranquilizers serve more specifically to counteract 
delusions, hallucinations, and other cognitive disorders (84, ch. 2). 

Figure 100 shows the results of a five-year survey conducted in the state 
mental hospitals of New York (9, 10). The findings are typical of those 
obtained in other American state hospitals following the introduction of 
tranquilizers. It will be seen that, as the number of patients put on tranquiliz- 
ing drugs rises, the number requiring restraint drops sharply. Beginning in the 
second year of drug therapy, the total state hospital population also declined. 
This decline resulted from an increase in the number of patients discharged as 
improved, rather than from changes in number of admissions. Representing 
the first reversal in the rising trend of American state mental hospital popula- 
tions, this finding was particularly dramatic. 

While indicating that the introduction of tranquilizers into overcrowded and 
understaffed mental hospitals was undoubtedly a wise move, this type of survey 
fails to identify the specific factors that brought about the improvement. ‘The 
investigators themselves point out that during the last three years of the survey 
conspicuous administrative changes were made in the state hospitals. ‘That these 
changes may themselves have been of primary therapeutic value will be seen 


F ig. 100 Trends in adoption of tranquilizers, use of restraint, and total 
patient population in New York State mental hospitals, 1954—1959. 
(From Brill and Patton, 10, p. 498.) 
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in the following section, dealing with environmental therapy. Even prior to 
the official innovations in hospital organization and procedures, the use of 
tranquilizers probably affected the attitudes of the hospital staff toward the 
patients. As patients become easier to manage, the behavior of the hospital 
personnel toward them takes on more constructive and therapeutic qualities. 
It has been observed, in fact, that the administration of tranquilizers to mental 
hospital patients may improve the behavior of the staff as much as that of the 
patients (35, p. 39). 

Surveys such as the one cited illustrate forcefully the distinction between 
applied and basic research. Although the results are clear with regard to the 
introduction of tranquilizers within a specific context, with its peculiar complex 
of circumstances, they tell us little that can be generalized to other situations 
and other sets of circumstances. For the latter purpose, we need the highly 
controlled experimental designs to be considered in Chapter 15. In this con- 
nection it is relevant to note that tranquilizers have failed to produce such 
dramatic effects in other countries where the shift to therapeutic environments 
in mental hospitals had antedated the advent of tranquilizers (18, pp. 5-18; 
35. p. 47; 83, pp. 681-682). 

Any statistical survey of hospitalized populations, moreover, presents a num. 
ber of methodological difficulties that preclude conclusive interpretation. of 
results. It is possible, for example, that various social and economic develop- 
ments may have reduced the total number of psychotics available for institu- 
tionalization. Such a decline may fail to show up in admission rates if more 
and more patients with milder disorders are admitted, Since milder disorders 
tend to have a higher recovery rate, however, such a trend could account at 
least in part for the increasing proportion of discharges and the resulting drop 
in mental hospital populations (56, 96). ; 

Even the most enthusiastic exponents of tranquilizers recognize that not all 
types of cases are helped by these drugs. Among those who generally fail to 
improve or are adversely affected. are depressed, apathetic, or withdrawn 
patients and deteriorated, passive schizophrenics (84). Another limitation in 
the application of tranquilizers arises from the occurrence of a number of 
adverse side effects, of both a medical and a behavioral nature. The latter 
range from sleepiness and drowsiness to severe motor symptoms including 
tremors and extreme stiffness. Little is as yet known about possible long-range 
or irreversible effects. All side effects, it should be noted, vary with the specific 
drug employed, the dosage, and the individual to whom it is administered. 


With small doses of the milder tranquilizers, side effects are negligible for 


most persons. From another angle, it has been cautioned that the indiscriminate 


use of tranquilizers may suppress anxiety below a desirable minimum and may 
thus reduce social sensitivity, ambition, and the restlessness that spurs one to 


constructive action. 
There is some evidence to suggest th 


tranquilizers and of sedatives (such as the b ^ 
In larger doses, tranquilizers affect cognitive and psychomotor performance, 


alertness, and other psychological functions in much the same way as do 
sedatives. An important advantage of tranquilizers, however, is that they v 
not habit-forming to any significant degree. There are almost no 4 cin : 
addiction to any of the tranquilizing drugs, although with very large doses o 
some tranquilizers withdrawal symptoms have been noted (84, ch. 7). 


at the difference in the effects of 
barbiturates) is one of degree (40). 
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Mention should also be made of the more recently developed "psychic ener- 
gizers,” or antidepressants (84, ch. 3). These drugs are believed to be useful 
in the treatment of depressed patients. They should not be regarded as the 
opposites of tranquilizers, however, since the two types of drugs have scveral 
effects in common. Moreover, tranquilizers themselves do not represent a homo- 
geneous class, since individual tranquilizers vary in many ways. The classifica- 
tion of such psychoactive drugs is not so simple as their popular names imply. 
Each drug needs to be examined individually with respect to therapeutic value, 
side effects, and other properties. 

Despite the waves of enthusiasm that have greeted each new somatotherapy, 
there is no one "miracle cure" for mental disorders. The various somatic 
therapies described above continue to be used with different cases, the choice of 
specific therapy depending upon the nature of the disorder, individual patient 
characteristics, and other circumstances. Several therapies may be tried in turn 
with the same individual. Since so many mental disorders orginate in the 
individual's inability to cope with problems of daily life, somatic therapies are 
coming to be regarded more and more as adjuncts to psychotherapy. The 
psychoactive drugs, even more than earlier forms of somatic therapy, are used 
increasingly to render the withdrawn, uncommunicative, unresponsive, agi- 
tated, or aggressive patient accessible to psychotherapy. It should be borne in 
mind that to reduce the anxiety arising from unsolved problems is not in itself 
a solution. 


ENVIRONMENTAL THERAPY 


Environmental therapy covers any manipulation of the client's environment 
outside of the formally scheduled therapeutic session. Its object is to improve 
the individual's behavioral adjustment by modifying physical or interpersonal 
aspects of his surroundings, or both. Environmental therapy may be applied 
within the ordinary contexts of daily life, such as home, school, and job, or 
within an institutional setting. Under the latter category are included some 
conspicuous innovations that are rapidly altering the psychological climate of 
mental hospitals. 

Environmental Therapy in Daily Life. Therapeutic environmental modi- 
fications are most commonly utilized with children, although in certain situa- 
tions they are also useful as supplementary procedures with adult clients. 
Therapists differ in the extent of their active intervention in the client's out- 
side activities. Nondirective therapists, for example, maintain that any efforts 
by the therapist to influence the client’s outside environment would interfere 
with psychotherapy. Hence they limit their activities to the therapeutic hour 
itself. Some psychotherapists, on the other hand, make extensive use of environ- 
mental manipulation. Environmental therapy alone is rarely enough, however, 
since part of the difficulty usually lies within the individual. In some forms of 
psychotherapy, the client is given specific therapeutic tasks to perform or roles 
to assume outside of the therapeutic hour and is expected to report on the 
results during the subsequent session. 

Environmental manipulations undertaken for therapeutic purposes may cover 
any phase of daily living. To reduce stress and provide a more wholesome 
milieu, the therapist may recommend a change of job or possibly a short 
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vacation for an adult client. For a child, he may arrange a transfer to a different 
school or to another class within the same school. When the child's home is 
highly unfavorable and resistant to improvement, foster home placement may 
be indicated as a last resort. In other cases, the individual may be guided into 
recreational activities specially suited to his needs. Joining a club, going to 
camp, participating in sports, or cultivating a hobby might each have thera- 
peutic value for a given individual. 

An important aspect of environmental therapy involves counseling with 
significant persons in the client's life, such as parents, spouse, teachers, and 
employers. Modifying the attitudes of these associates toward the client and 
giving them a better understanding of his problems may go a long way toward 
reducing stress and interpersonal conflicts. In psychotherapy with children, it 
is customary to require one parent—and sometimes both—to meet regularly 
with a therapist. The latter may not be the same therapist who works with the 
child, but in that case both therapists work closely together. It sometimes hap- 
pens that the parents are more in need of psychotherapy than the child and 
that the child's problems disappear when the parents resolve their own emo- 
tional difficulties. 

Some environmental modifications may require the specialized resources of 
social service agencies. In environmental therapy, psychologists often collaborate 
closely with social workers. Furthermore, certain types of adjustment problems 
are commonly handled by social workers themselves, whose approach is char- 
acteristically that of environmental manipulation. Psychiatric caseworkers are 
social workers specially trained to work with maladjusted individuals. 

Institutional Environments.  Institutionalization itself may be regarded as 
a form of environmental therapy. In an institution, it is possible to manipulate 
all aspects of the individual's environment for 24 hours a day. Until quite 
recently, however, institutions have been so organized as to serve principally a 
custodial rather than a therapeutic function. With their overcrowding and 
understaffing, their emphasis upon security and restraint, and their deadening 
inactivity, state mental hospitals have often contributed to the deterioration of 
chronic cases in the “back wards.” Sitting idly on benches from morning till 
night is not therapeutic. The longer the individual lives in such a psychological 
vacuum, the farther he retreats from reality. ‘ À 

Today the atmosphere of mental hospitals is rapidly changing, as the con- 
cept of a therapeutic community replaces that of a custodial institution G8; 
27; 29, pp. 218-222; 35, ch. 2; 57). This approach was first explored in institu- 
tions for children and adolescents, where the possibility of re-education and 
rehabilitation was more readily apparent (see, e.g, 7; 18, pp. 11-13). The 
decade of the 1950s saw its large-scale application in American state mental 
hospitals. Tranquilizers undoubtedly contributed much to the ensuing =. 
provements in both staff attitudes and administrative practices. Lori ae s 
were gradually abolished, and the "open hospital" became standard. Under 


these conditions, most patients became more tractable and less aggressive, and 


their general behavior improved rather than deteriorated. To what extent 
ilizing drugs and to what extent 


these changes resulted from the use of tranqu " x 
they resulted from institutional reorganization cannot be readily determined. 
ced in American hospitals at 


It is certainly true that tranquilizers were introdu E ok 
a time when the mental health professions were ready for the many administr: 


tive innovations that were facilitated by these drugs. 
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The inactivity and custodial atmosphere typical of earlier institutions is 
giving way to milieu therapy. Essentially this therapy is an attempt to utilize 
all the patient's daily experiences for therapeutic purposes. Considerable em- 
phasis is placed upon the cooperation of all hospital personnel who come in 
contact with patients, from the director of the hospital to ward attendants and 
janitors. The attitudes of staff members toward patients is a major aspect of 
milieu therapy. In this regard milieu therapy reflects the therapeutic utilization 
of interpersonal relations, which is characteristic of the chief forms of psy- 
chotherapy to be discussed in the next section. 

Activity on the part of the patient is likewise stressed. In one form of milieu 
therapy, known as "total push" therapy, constant and varied activity is the 
predominant feature, even somatic therapies of various sorts being introduced 
largely for their psychological value (see 29, pp. 218—219; 65). Occupational, 
or work, therapy has long been recognized as an adjunct to other forms of treat- 
ment. In milieu therapy, it assumes more importance and becomes more highly 
diversified. Patients are given training and experience in vocational activities 
suited to their individual abilities. Working in sheltered workshops that are 
conducted along realistic lines affords direct preparation for returning to the 
outside community. Paying the patient for work he does while in the institu- 
tion may have good motivational value and is a common practice in European 
hospitals. Recreational and social activities, like parties, outings, cookouts, 
motion pictures, art, music, and sports, are also part of the program. These 
activities provide practice in interpersonal contacts, as well as opportunities to 
be creative, to experience success, and to obtain other satisfactions. Trips and 
visits to theaters, museums, and other recreational resources help to reestablish 
contact with the outside community. 

Treating each patient as an individual and boosting his morale by improving 
his own appearance and that of his surroundings are additional aspects of 
milieu therapy. Increasing emphasis is also being placed upon patient govern- 
ment, through regular group discussions, committees, and other techniques for 
patient participation in planning and decision making (18, pp. 129-146; 26). 

Although all the above developments are examples of milieu therapy, the 
essential characteristic of this type of therapy is the utilization of all hospital 
resources in an integrated therapeutic program. What is included in the 
therapeutic program may vary widely from one institution or one ward to 
another. Somatic therapies and psychotherapy are often incorporated in the 
total program. Ideally the content of the program should be adapted to the 
needs of the individual patient. For some, a highly structured environment with 
rigid schedules may be desirable; for others, maximum freedom in the choice 
and scheduling of activities may be most effective. The nature of the specific 
activities would obviously vary as a function of individual patient character- 
istics. From another angle, size of institution and patient-staff ratio are limit- 
ing factors in milieu therapy. 

In line with the current trend toward rehabilitation and resumption of 
normal community living, mention should also be made of the establishment 
of day hospitals, night hospitals, week-end hospitals, halfway houses, and other 
arrangements for partial institutional treatment (16; 18, pp. 163-176; 35, pP- 
180-182; 42; 88; 93). In a day hospital, the patient reports for therapy from 9 
to 5 on weekdays, while continuing to live in his own home. It is obvious that 
such an arrangement implies the application of milieu therapy. There would 
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be no point in having patients come to the hospital to sit on benches from 9 to 
5. Even less disruption of normal living is involved in the might hospital, 
which enables the patient to continue with his job or other regular daytime 
activities, while reporting to the hospital for treatment in the evenings. De- 
pending upon his specific needs, the patient may remain overnight at the 
hospital or return home after the evening sessions. Halfway houses are de- 
signed to aid in the rehabilitation of former mental hospital patients and to 
facilitate their transition to community living. With developments such as 
these, the sharp distinction between inpatient and outpatient care is disappear- 
ing. Instead, there are now available a wide variety of therapeutic schedules, 
from a one-hour therapy session to complete hospitalization, which may be 
adapted to the needs of the individual patient. 


PSYCHOTHERAPY 


General Characteristics. ln the New York State certification law for 
psychologists, psychotherapy is defined as "the use of verbal methods in inter- 
personal relationships with the intent of assisting a person or persons to 
modify attitudes and behavior which are intellectually, socially, or emotionally 
maladaptive" (55, p. 5o). Elsewhere psychotherapy has been described as "a 
series of very specialized conversations with a skilled clinician" (69, p. 514). 
To be sure, psychotherapy is not always strictly verbal, especially when applied 
to young children; but it always involves some means of communication 
tween therapist and client. PN 

Regardless of their theoretical persuasion, practically all clinical psychol- 
ogists today agree that the interpersonal relationship between therapist and 
client is an important element in psychotherapy—if not the most important. 
The personality of the clinician and his behavior toward the client during the 
session are thus an integral part of the therapeutic process. It is also generally 
recognized that psychotherapy is a learning process. The individual acquired 
his maladaptive responses through past learning; psychotherapy offers him a 
method for unlearning these responses and replacing them with new responses 
of a more adaptive nature. j 

The eit nature of the client-therapist relationship was hus y 
in Chapter 13, in connection with the question of rapport in clinical inter- 
viewing. Essentially, this relationship can be characterized as has oe 
threatening, accepting, and permissive. In his interactions with the ra 1 
the individual gradually discovers that, no matter what he says, he will not be 
criticized, blamed, or rejected. The strict confidentiality observed also reassures 
him that his behavior during the therapy hour will lead to no aversive sr 
sequences in his daily life. The therapeutic relationship thus provides a “sa S 
situation in which the individual can try out new interpersonal Rory " 
this respect, the clinician serves the same function as the beoe : 
simulators used in flight training. The learner can make Pa e e x 
danger to himself. The client's interaction. with the therapist pia ae 
understand his reactions to people in general and enables him to develop 


N iu inking about himself and others. : 
mp tied to explain the process of psychotherapy in 


Several psychologists have trie DEN RN 
terms of learning principles (5; 20; 29. ch. 7; 50; 69, ch. 16; 71; 72; 94; 95) 
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Since psychologists differ in their theoretical orientation with regard to both 
learning and psychotherapy, no one explanation would find general acceptance. 
It should also be borne in mind that therapeutic procedures developed largely 
on an empirical basis; it was only later that theoretical rationales were for- 
mulated to support them. A few examples will suffice to illustrate how learn- 
ing theory has been applied to psychotherapy. 

A major goal of all psychotherapy is the overcoming of anxieties and the 
elimination of defenses built up against these anxieties in the form of be- 
havioral symptoms. Within the context of a secure and emotionally satisfying 
therapeutic relationship, the client is encouraged to verbalize his anxieties and 
the situations that led to them. Talking about an anxiety-provoking situation 
approximates the experience of reliving it. The symbolic, verbal reinstatement 
of the situation arouses much of the anxiety associated with the situation itself 
(see 51). Through the processes of extinction and counterconditioning, the 
original responses to these situations tend to disappear and are replaced by the 
more positive affective responses of security and confidence aroused by the 
therapeutic situation, Through stimulus generalization, these newly acquired 
responses tend to be evoked by similar situations subsequently encountered 
outside of therapy (69, ch. 16). The individual may also develop a generalized 
learning set that will make him better able to solve emotional and interpersonal 
problems in the future (47). 

In psychotherapy, timing is important. If too much anxiety is aroused early 
in therapy, conditioning may occur in the wrong direction. The therapeutic 
situation itself might thus become anxiety-provoking and intolerable for the 
client. Anxiety must be aroused very gradually and only after a sufficiently 
secure therapeutic relationship has been established. As the client overcomes his 
initial anxiety and learns to cope with it, he can safely progress to a considera- 
tion of situations evoking more severe anxiety. As anxiety is reduced, the 
individual is able to examine his problems rationally and work out more 
effective ways of handling them (20, ch. 14). 

In the course of therapy, the client’s self concept also tends to change in a 
favorable direction. Under the influence of the therapist's accepting attitude, 
the client experiences a growing sense of his own personal worth. Insofar as 
this response generalizes to other persons he encounters outside of the thera- 
peutic situation, the client will exhibit more confidence and security in inter- 
personal relations. There is evidence to suggest that feelings of personal worth 
tend to be accompanied by a more favorable evaluation of others (see, e.g. 
70, 81). These newly aroused attitudes on the client's part will in turn elicit 
more favorable responses from his associates. Hence a cycle of positive be- 
havior and reinforcement is started in the individual's real-life experiences 
whereby the effects of therapy are further generalized. 

One might ask why verbalizations are used to mediate the conditioning that 
occurs during psychotherapy. Why not work directly with the maladaptive be- 
havior we are trying to eliminate? The answer is that we can, if the behavior 
is of the sort that can be readily evoked in the therapeutic situation. Extinction 
and counterconditioning have been successfully applied to the elimination of 
phobias, compulsions, motor tics, and other specific symptoms (see 5, 94, 95). 
In a classical early experiment (36), a child’s fear of furry objects was over- 
come by having a rabbit in the room while the child was eating. During the 
initial sessions, of course, anxiety had to be minimized by keeping the rabbit 
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caged and at a distance and by leaving him in the room only a short time. 
After repeated sessions, fear responses were eliminated, not only to the 
rabbit, but also to other furry objects, through stimulus generalization. 

The traditional objection to such an approach is that it treats the "symptoms" 
rather than the “cause.” This objection, however, seems to be based on an 
inappropriate analogy with organic disease. A toothache is only an indicator 
of dental decay. But in behavior disorders the maladaptive responses them- 
selves constitute the disorder, This distinction provides a further example of 
the inapplicability of the disease analogue to psychological disorders (see 
Chapter 13). It is the object of therapy to change the maladaptive responses 
(see 5, 22, 50). Whether such behavioral changes can be effected more 
successfully when the responses are regarded as "symptoms" of some underlying 
adjustment mechanism remains to be demonstrated. These “underlying mecha- 
nisms” are, after all, unverified hypotheses contributed by the therapist, rather 
than observable facts contributed by the client. 

Scope of Psychotherapy. Many clinical psychologists set a number of 
limitations upon the type of client they will accept for psychotherapy (see 8, 
ch. 5; 20, ch. 15). The requirements vary considerably with the theoretical 
orientation of the therapist and the form of psychotherapy he practices. Be- 
cause of the heavy reliance upon verbal communication in most types of psy- 
chotherapy, minimum educational and intellectual levels are often specified. 
Background factors that would affect the client's motivation for therapy are 
sometimes considered. The various indices for predicting response to therapy 
that were cited in Chapter 13 are also relevant in this connection. Psycho- 
therapy has been traditionally employed with neurotics and with essentially 
normal persons who want help with some of their emotional problems. More 
recently, however, there has been a growing interest in the application of 
psychotherapy to schizophrenics and other severely disturbed cases (13, 172. 
It will be recalled that somatic therapies have been used increasingly for the 
purpose of making psychotics accessible to psychotherapy. 

Surveys of the relationship between type of therapy employed and patient 
variables have consistently shown that socioeconomic level is an important 
factor (11; 32, ch. 9; 33; 64). The higher the individual’s socioeconomic level, 
the greater are his chances of being accepted for psychotherapy in outpatient 
clinics, of being assigned to a relatively experienced therapist, and of con- 
tinuing in therapy. Patients in lower socioeconomic levels are more likely to 
receive somatic or simply custodial treatment than psychotherapy. These differ- 
ences remain even when cost, availability of therapists, and similar extraneous 
factors are ruled out. 

Since socioeconomic level is itself related to educational and intellectual 
levels, the latter factors undoubtedly account in part for the more limited use of 
psychotherapy with lower-class patients. Another contributing factor is the dis- 
crepancy between the socioeconomic levels of therapist and client. The more 
similar the client's and therapist's cultural backgrounds are, the easier it will 
be for the therapist to understand and communicate with the client (11, 86, 
92). The concept of assumed similarity discussed in Chapter 13 may be Te- 
called in this connection. Still another reason is to be found in the motivations, 
attitudes, and expectations of persons in different socioeconomic levels. Motiva- 
tion for therapy will tend to be higher when the patient has more to look 
forward to in terms of vocational achievement, social status, and other tangible 
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rewards of improved emotional adjustment (20, ch. 15). Persons in high 
socioeconomic levels tend to perceive therapy as a means of maintaining status 
in their immediate social environment (33). They also have a better under 
standing of psychotherapy and a more rationalistic approach to their emotional 
problems than do persons in lower socioeconomic levels. The latter are more 
likely to perceive their emotional disorders as a disease for which they expect 
a quickacting, somatic remedy C11; 32, ch. 11; 92). 

So far we have concentrated upon the common features of psychotherapy. 
Even these features, however, are more characteristic of some forms of psycho- 
therapy than of other forms. In the feld of psychotherapy, "schools" still 
thrive. There are currently many varieties of psychotherapy, each with its 
loyal adherents (8, 12, 30, 80). One recent survey listed 36 different systems 
(30). It is heartening to note, however, that the trend is toward eclecticism. 
Well-trained clinical psychologists recognize that different forms of psycho- 
therapy may be required for different clients and that no one form is uni- 
versally to be preferred. In a survey of the members of the APA Division of 
Clinical Psychology, conducted in 1960, 49 per cent of those engaged in 
psychotherapy described their orientation as “eclectic” (45). Like its clients, 
psychotherapy itself becomes better integrated as it grows toward maturity. 

To illustrate more fully what psychotherapists do, we shall briefly examine 
in the next two sections two of its most widely practiced forms, nam 
psychoanalysis and client-centered therapy. Although psychotherapy is pre 
dominantly individual and verbal in its techniques, two important procedural 
variants are group therapy and play therapy. Each may be employed by clini- 
cians with widely varying theoretical orientations. These procedures will be 
considered in the last two sections. 

Psychoanalysis. The theory and techniques of psychoanalysis originated 
during the last quarter of the nineteenth century in the work of Sigmund 
Freud, a Viennese physician who became interested in the treatment of 
emotional problems (see 52, chs. 22 and 23). Freud's approach to these prob- 
lems still provides the general framework for psychoanalysis, although many 
changes in specific therapeutic procedures and in theoretical rationale have 
been introduced (see 20; 29, ch. 4; 69, ch. 14). The principal object of psy- 
choanalysis is to uncover repressed impulses and forgotten experiences that are 
believed to underlie present behavioral symptoms. The client is thereby given 
an insight into possible causes of his maladaptive behavior. He is also enabled 
to relive emotionally the incidents in which the repression occurred. As he 
becomes aware of his repressed impulses in a context of reduced anxiety, he is 
able to bring them under rational control. In the classical psychoanalysis of 
Freud, the origins of emotional difficulties were traced to early childhood 
experiences. Among neoanalysts, the causes of symptoms are more often sought 
in the individual’s present interpersonal relations. Partly for this reason, the 
newer psychoanalytic therapies generally require less time than classic psycho- 
analysis, a condition which makes them applicable to a wider range of clients. 

Among the special techniques employed by psychoanalysts are free associa- 
tion and dream analysis. In the former, the client is asked to report everything 
that comes to mind; nothing is to be held back, however irrelevant, absurd, or 
embarrassing it may seem. To facilitate such free association, the client tradi 
tionally reclined on a couch, with the analyst seated outside his field of vision. 
Today the procedure is more flexible: the client may sit, lie down, or walk 
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around as he wishes. Although the couch has remained the symbol of psycho- 
analysis in. popular thinking, some contemporary analysts have discarded it 
altogether. Dream analysis is utilized by psychoanalysts as another source of 
cues to unconscious or repressed impulses, which, they maintain, appear in 
dreams in disguised and symbolic form. The dreams reported by the client may 
be interpreted directly by the analyst in terms of common psychoanalytic 
symbols, or they may serve as the starting point for further free association by 
the client. 

With regard to the client-therapist relationship, psychoanalysts emphasize the 
process of transference. By "transference" is meant that in the course of therapy 
the patient identifies the therapist with a parent or some other important person 
in his life and projects upon him the emotions he feels toward that person. 
These feelings may be positive, such as dependence and love, or negative, 
such as hostility and antagonism. The therapist uses the transference relation- 
ship as a device to help the client relive his earlier emotional experiences. As 
the treatment progresses, the transference itself is gradually terminated by 
explaining its source to the client. 

It is apparent that active interpretation by the therapist is a major feature 
of psychoanalysis. The analyst interprets the client's dreams, his casual re- 
marks and slips of the tongue, the apparently irrelevant content of his free 
associations, and the feelings the client expresses toward the analyst himself. 
When resistance is encountered in free association, he interprets it as evidence 
of repression and calls the client's attention to the defenses he is using against 
anxiety-provoking content. By bringing together disparate bits of information 
that the client provides, the analyst offers him an explanation of the source 
of his symptoms. In building up such an explanation, the psychoanalyst 
asks probing questions to elicit additional facts. All too often the inquiry may 
take the form of what Sarbin and his associates describe as “soliciting” rather 
than “probing” (see ch. 13). There is thus a danger that the resulting ex- 
planation will reflect the psychoanalyst's theoretical views more closely than 
it does the personality dynamics of the particular client. 

In its early forms, psychoanalytic theory was expressed largely in terms of 
crude physical analogies and figures of speech. Most of its propositions were 
so formulated as to be untestable. For instance, if a client agreed with the 
analyst's interpretation of a symptom, that was regarded as supporting evidence 
of the correctness of the explanation; if he disagreed with it, his reaction was 
attributed to repression of anxiety-evoking material. With such a highly sub- 
jective approach, it is little wonder that different schools of psychoanalysis 
quickly developed, each with its highly partisan protagonists. : 

In its later evolution, psychoanalysis has moved closer to the science of 
psychology. Its language has become less fanciful, and its claims are more 
often verifiable—even though still largely unverified. A promising beginning 
has been made by some psychologists in their attempts to explain psycho- 
analytic concepts and procedures in terms of learning theory (see, e.g. 207. 
Such reformulations make it possible to design experiments to test psycho- 
analytic hypotheses. Examples of this type of research will be considered in 
Chapter 15. a e M gr 

Client-centered Therapy. Also known as “nondirective counseling, 
client-centered therapy was first described in 1942 by Carl R. Rogers, an 
American psychologist (58, 59, 60). In contrast to psychoanalysis, client- 
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centered therapy reduces interpretation by the therapist to a minimum (82). 
The basic postulate of client-centered therapy is that the individual has the 
capacity to identify the sources of his emotional problems and to work out 
effective solutions, once he is freed from disabling anxieties and feelings of 
insecurity. To accomplish this goal, the client-centered therapist establishes a 
permissive, accepting, non-threatening therapeutic relationship. He refrains 
from probing, interpreting, advising, persuading, or suggesting. He serves the 
function of an active listener, trying to understand fully what the client says 
and feels, and making every effort to perceive situations from the client's 
point of view. 

An important part of the therapist’s task is to reflect and clarify the client's 
feelings by restating the client's remarks. This clarification of feeling is illus- 
trated in the following excerpt from a recorded interview with a young woman 
of twenty. The excerpt begins near the end of the first interview (77, pp. 139- 
141). 


Cl. I realize, of course, that it all began a long time ago—because everything 
begins somewhere. I wasn't just—somehow or other something failed somewhere 
along the line. And I guess we sort of have to get at it, a sort of re-education. But 
I don't feel as though I can do it myself. 

Th. You realize that the roots must go a long way back, and that at some point 
you will have to start in reworking it, but you're not sure whether you can do it. 

Cl. "That's right. (Pause) It's just the idea that I can see myself going through 
life this way, fifty, sixty, and seventy years old—still thinking these horrible 
thoughts. And it just doesn't seem worthwhile—I mean, it's so ridiculous. While 
everybody else is going their way and living life, I'm sort of at the edge, and look- 
ing on. It just isn't right. 

Th. The future doesn't look very bright when you look at it that way. 

Cl. No. (Long pause) I know I’m lacking in courage, that's the big thing I'm 
lacking. That must be it, ‘cause other people aren't swayed so easily. It's a funny 
thing, though—when I think of those—those qualities, I always think of them, I 
don’t know, not as realities, but as something that’s far off somewhere. It’s a hard 
thing to explain these things. It’s just as though—it's—it's true but I laugh at it in 
a way. Sort of a feeling that I am sort of sneering at it—but I know it must be true 
cause other people go around expressing those things. It’s a very confused feeling. 

Th. Logically, you realize that courage is one of your deficiencies, but inside 
yourself you find yourself laughing at that notion and feeling that it doesn’t really 
have anything to do with you. Is that it? 

Cl. That's right. I always sort of make myself different. That's it. 

Th. M-hm. You sort of say that might apply to other people but it's not for 
you because you're different. 

. CL I don't know whether that's exactly right or not. I can't put my finger on 
Ee I feel lonely and sometimes I feel another way. Do you have cases 

Th. You really wonder whether anybody else could be— 

CL I think Im worse than anybody that I know. That's just it. I feel as though 
I am terribly, terribly low. It just does not seem worth—bothering with it, it doesn't 
seem worthwhile, that I can't get up there to first base. 4 

Th. You think about making the struggle, but it doesn't seem possible. 

Cl. That’s right. I just wonder what other people do when they find problems 
and stuff. I just wonder whether they see it through or try to find out some 
thing else. 

Th. You feel that you'd like to know how somebody else would handle it. 

1 CI. stands for “client,” Th. for "therapist." 
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Through systematic analyses of recorded interviews, client-centered thera- 
pists have conducted extensive research on the therapeutic process itself (6r, 
67, 70, 76, 81). These studies indicate that in the course of clientcentered 
therapy there is an increase in the number of statements showing insight and 
self-understanding, as well as in statements regarding client plans and de- 
cisions. Similarly, a comparison of early and late therapeutic sessions shows 
an increase in number of statements expressing positive attitudes toward self 
and a decrease in statements expressing negative attitudes toward self. The 
individual's self concept comes to resemble his ideal concept more closely in 
the course of therapy. Acceptance of others and respect for others also in- 
crease with the growing acceptance of oneself. The methodology of these 
studies and their broader implications regarding the effectiveness of therapy 
will be considered in Chapter 15. 

Group Therapy. Although group activities and group discussions were 
utilized therapeutically in different settings since the beginning of the cen- 
tury, the widespread adoption of current methods of group psychotherapy 
dates from World War II (29, ch. 10; 31; 39; 54; 74). First introduced as a 
means of coping with a shortage of therapists, group therapy was soon seen 
to have a number of intrinsic advantages. In its most usual form, it has much 
in common with individual psychotherapy. A group of 5 to 1o persons meet 
once or twice a week for several months to discuss their emotional problems. 
The therapist, who serves as discussion leader, tries to create a permissive 
atmosphere that reduces resistance and defensiveness. Each participant 
describes his own experiences and difficulties as he feels inclined and com- 
ments upon the experiences of other group members. An informal atmosphere 
is maintained, with group members sitting around a table or in some other 
casual grouping. 

Most persons are eventually able to relive emotionally charged experiences as 
well in group therapy as they can in individual therapy—and sometimes better. 
Recognizing that others have similar problems enables them to consider their 
own problems more objectively. Participation in group therapy helps to break 
down feelings of isolation so common among emotionally disturbed persons. It 
also gives the individual a more extensive and realistic field in which to 
practice interpersonal behavior than is provided by individual psychotherapy. 

To insure maximal effectiveness, participants in group therapy are usually 
screened for various characteristics, and groups are assembled with special 
reference to age and sex distribution, type of problems, and other relevant 
variables, Group therapy may be employed exclusively or in combination with 
individual psychotherapy. When the two approaches are combined, they 
may run parallel or individual psychotherapy may be used in advance to 
prepare the client for group therapy. 

There are fully as many varieties of group as of individual psychotherapy. 
The therapist may play a very active role in steering discussion and interpret- 
ing member reactions; or he may be nondirective, limiting his remarks to 
acceptance, restatement of content, and clarification of feelings. Psychoanalyti- 
cally oriented. group therapists utilize transference reactions, which develop 
not only toward the therapist but also toward other group members. 

Group therapy has been employed with special groups, such as members 


of a single family, mothers of emotionally disturbed children, or former 
T 5 e mesi anneinaine Mysranenitie 


natiente wha mns 


THE 


CLINICAL PSYCHOLOGY 


experiences. Such well-known national groups as Alcoholics Anonymous (43) 
and Recovery, Inc. (89) employ some form of group therapy. The latter 
organization was founded in 1937 by recovered patients who had been treated 
at the Psychiatric Institute of the University of Illinois Medical School. 
Group therapy may also be combined with other group activities. In onc 
Veterans Administration hospital, for example, a group living program was 
introduced, in which small, family-like groups of schizophrenics participated 
for several months in a joint program of social, educational, recreational, and 
work activities, as well as in group psychotherapy (91). Results were promis- 
ing when compared with those of somatic therapies and of a traditional insti 
tutional activity program. 

Psychodrama (48) is a variant of group therapy. In this technique, the 
client acts out unrehearsed parts on a stage, with other clients in the audience. 
The therapist usually assigns roles, directs the performance, and interprets the 
individual's behavior. Supporting parts are played by therapeutic assistants o: 
by other clients. The individual may play himself in realistic situations from 
his daily life; he may play the roles of other persons in his life, such as his 
father; or he may improvise imagined roles. Psychodrama is designed to pro 
vide emotional release as well as self-insight. Role playing is a simplified ver- 
sion of psychodrama which does not require a theatrical setting or special 
props. It is used with clinical groups, as well as in certain types of training, 
some of which were illustrated in Chapter 5. 

Play Therapy. Because children have difficulty in verbalizing their emo- 
tional problems, play therapy is often employed in either individual or group 
form (4, 21, 25, 49). In the play session, the child has an opportunity to act 
out his anxieties, hostilities, aggressions, and other disturbing feelings. Among 
the media utilized for this purpose are puppets, dolls, toy furniture, clay, sand, 
and finger paints. ^ 

"Therapy sessions are usually conducted in a room that is as indestructibly 
furnished as possible, and the child is allowed to do what he wishes, within 
broad limits. Through his accepting and nonthreatening manner, the thera- 
pist establishes a warm, secure interpersonal relationship with the child, just 
as he does in the verbal interaction with adult clients. In this safe context, the 
child obtains emotional release from his manipulations of the play materials. 
With the aid of interpretations or clarification by the therapist, the child may 
also gain insight into his own motives and feelings. 

Figure tor shows a little girl caring for a baby doll in play therapy. Using 
dolls that represent mother, father, and siblings, the child can enact family 
themes revealing attitudes that she would have difficulty in communicating 
verbally. 


MENTAL HEALTH MOVEMENT 


The mental health movement is a broad social movement that has been 
gaining momentum since World War II. Its recent upsurge is characterized 
by a growing public awareness of mental health problems, governmental sup: 
port of mental health activities, and the cooperative efforts of several 
professional specialties, including medicine, psychology, and social work. A 
significant development within this movement was the enactment of the 
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Fig. 101 A child using dolls in play therapy. (From Ginott, 25, p. 54.) 


Mental Health Study Act of 1955 by the Congress of the United States and 
its implementation by the Joint Commission on Mental Illness and Health 
(35). With the help of 36 participating organizations representing many 
disciplines, this Commission conducted extensive investigations of the coun- 
try's mental health problems and available resources for meeting them and 
submitted a set of recommendations for action. 

Although the term “mental health” has misleading connotations, it has 
become so strongly associated with the movement that it would be difficult 
to replace it with a more precise expression, such as “effective behavior” (2, 
75). It seems more expedient to redefine the term within its present context. 
he current mental health movement is characterized not only by renewed 
efforts to treat and rehabilitate severely disturbed cases but also by an in- 
creased emphasis on prevention, on the development of positive mental health, 
and on a comprehensive approach that encompasses all aspects of daily life. 
Because of its special concern with preventive measures and its broad social 
orientation, the mental health movement is logically an aspect of the general 


public health movement. 
Prevention. One of the objectives of the mental health movement is to 
increase facilities for identifying and treating mild and incipient disorders. 


Known as “secondary prevention,” this approach tries to reach the individual 


before he de velops more serious difficulties. Community mental health clinics 
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for outpatient treatment are an important resource for this purpose. The Joint 
Commission report (35, p. xii) calls attention to another potential source of 
help for referral and for direct handling of minor problems, namely, the 
persons to whom individuals in trouble generally turn and those in a position 
to spot trouble in its early stages. Examples include clergymen, family physi 
cians, teachers, public welfare workers, scoutmasters, and many others. Short 
training courses and the availability of professionally trained mental health 
consultants would enable these persons to function more effectively in their 
role of “mental health counselors.” 

An exploratory survey conducted in a lower-class district in New York 
City was concerned with the ability to recognize mental illness and the 
recommendations and referral practices of community leaders in various fields 
(19). A number of significant differences were found among leaders in differ 
ent fields, such as religion, education, business, and politics. Although neces- 
sarily limited in its findings, this study illustrates the kind of information t! :t 
is needed in planning for the better utilization of community leaders in me: tal 
health activities. 

Still another way to encourage secondary prevention is to modify pu! ‘ic 
attitudes toward emotional disorders—an admittedly difficult task. Several 
well-conducted attitude surveys have demonstrated that emotionally disturbed 
persons still encounter widespread social rejection (35, ch. 3; 53). By the very 
nature of their difficulties, such persons lack social appeal. They are likely to 
be uncooperative or hostile even toward those who try to help them. Their 
behavior often estranges them from family and friends. Their social rejection 
engenders further withdrawal and hostility, which in turn lead to more re- 
jection in a continuing cycle. Breaking this cycle, it will be recalled, is one 
of the objects of psychotherapy. Unfortunately, the social stigma that still 
attaches to “mental illness” makes the mildly troubled individual reluctant to 
seek professional help and makes his associates hesitate to recommend it. ‘That 
attitudes toward the mentally disturbed can be altered, however, is illustrated 
by the excellent work done through the college student volunteer program 
with highly deteriorated state hospital patients (35, pp. 88-93; 85). 

A more direct attack, of course, is through primary prevention, i.c, the 
prevention of even minor difficulties that need professional help. This objective 
requires that the experiences of persons in general—especially in childhood 
—]be so structured as to strengthen their ability to withstand the strains and 
stresses of daily living. It has been repeatedly observed that mental health is a 
way of reacting to problems rather than an absence of problems (6; 28; 35, 
ch. 4; 75). Happy, well-adjusted persons may have many worries, but they 
know how to cope with them. À : i 

Positive Mental Health. There is an increasing intere 
goal of attaining mental health as contrasted to the purely negative goal of 
avoiding mental illness (34, 37, 75). The range of persons that the mental 
health movement tries to reach is thus greatly extended. It is clear that many 
essentially normal persons have emotional problems with which they can be 
helped. Surveys of random samples of the general population have repeatedly 
revealed a high incidence of emotional problems (28, 78). In one of the 
studies sponsored by the Joint Commission, for example, a representative 
nationwide sample of American adults living at home was intensively inter- 
viewed (28). In this group of approximately twenty-five hundred persons, 
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nearly one respondent in four said he had at some time had an emotional 
or interpersonal problem for which professional help would have been useful. 
Only one in seven, however, had actually sought help. And, of these, 42 per 
cent had consulted clergymen, 29 per cent general physicians, 18 per cent 
psychiatrists or psychologists, and ro per cent social agencies or marriage 
clinics. 

An even more important aspect of the present focus on positive mental 
health is the recognition that mental health means more than freedom from 
emotional difficulties. Several attempts have been made to define mental health 
in terms of active attainment of important goals rather than mere adjustment 
(see, e.g, 34, 37, 46, 68, 73, 75). In this connection, it should be noted that 
any definition of either mental illness or mental health implies value judg- 
ments (see 41, 66, 75). With regard to the negative end of the scale there is 
considerable agreement, at least within a given culture. Thus most persons 
in contemporary Western civilization would agree on the maladaptive and 
undesirable nature of such behavior deviations as violent acts of aggression, 
ballucinations and delusions, or the sort of intellectual disorganization that 
makes it impossible to attend school or hold a job. With regard to the positive 
end of the scale, however, values differ more widely among groups and among 
individuals. 

Definitions of mental health inevitably reflect these differences in values, 
although some may evade the problem by their vagueness. Mental health has 
been variously described in such terms as creativity, spontaneity in interper- 
sonal relations, integrated personality, correct perception of oneself and of 
one's environment, autonomy, self-actualization and continued growth, rich- 
ness of experience, and active mastery of environment. These are undoubtedly 
worthy values, but others could easily be added to the list. In specific situa- 
tions, moreover, some values may conflict with others, thus requiring judg- 
ments of relative strength. Nondirective, or client-centered, therapists have 
come closest to defining mental health without reference to a specific value 
system (62). Essentially, their object is to free the individual from debilitat- 
ing tensions and anxieties so that he may more effectively pursue his own 
goals. One cannot escape the fact, however, that even this goal of individual 
freedom represents a prior value judgment on the part of the therapist. More- 
over, the individual's own values reflect the earlier influences to which he 
was exposed in home, church, school, and community. : L 

We need to recognize that mental health may be attained in very different 
ways by different persons. And we need to recognize that the value systems 
presupposed by these varying concepts of mental health lie outside the domain 
of science. To be sure, scientific research can help in clarifying and in imple- 
menting values. It can, for instance, reveal cultural differences in value sys- 
tems and the variables associated with such differences. It can also help to 
make explicit the values underlying an individual's behavior. This knowledge 
should increase the individual's ability to make his choices deliberately rather 
than by default. 4 ^ 

Comprehensive Approach. It is apparent that both the prevention ol 
emotional disorders and the fostering of positive mental health as now envis- 
aged require a broad, comprehensive program. Among the fields encompassed 
by such a program are family life, education, and employee relations. Train- 
ing programs for marriage and parenthood provide an opportunity for im- 
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proving not only child-rearing practices but also all interpersonal relations 
within the family. The schools can contribute much to mental health through 
both curricular and extracurricular activities, interpersonal relations of the 
child with his peers, and teacher-pupil relations. Much of what was said in 
Chapter 6 regarding employee relations, supervisory practices, and organiza- 
tional structure is obviously relevant to the promotion of mental health. Some 
exponents of the mental health movement think in even broader terms, to 
include community development and the application of mental health prin- 
ciples to large-scale social planning. Crime, delinquency, and even war, they 
argue, are in part problems of mental health. We can certainly grant that 
such conditions represent malfunctions in interpersonal systems, to borrow a 
concept from engineering psychology. 

What roles can the psychologist play in this far-flung program? Thoughtful 
analyses of the psychologist’s unique pattern of qualifications suggest that he 
can serve best through research and consultation (2, 24, 35, 63). Research, 
both basic and applied, is urgently needed in the area of mental health. 
Although much is being done in the effort to alleviate emotional disturbance 
and to promote positive mental health, there is little objective evidence regard- 
ing the effectiveness of any specific procedures. 

With regard to service functions, the professionally trained psychological 
consultant, working through practitioners in other fields with a minimum of 
mental health training, can greatly extend his sphere of operations. There is 
a vast discrepancy between the need for mental health services and the trained 
manpower available to meet this need (1). Consequently, direct work with 
single clients, as in individual psychotherapy, can reach only a very small 
proportion of persons requiring psychological services. Even apart from this 
personnel shortage, however, it is likely that psychologists will function in- 
creasingly as mental health consultants in the future. The growing emphasis 


on prevention and on positive mental health would seem to point in this 
direction. 


SUMMARY 


The many kinds of available therapies for treating emotional disorders may 
be grouped under somatic therapies, environmental therapies, psychotherapy, 
and preventive mental health procedures. Somatic therapies, requiring the 
services of a licensed physician, are illustrated by psychosurgery, shock thera 
pies, and various applications of psychopharmacology. The introduction of 
tranquilizers has led to a spectacular rise in the recovery rate of American 
state mental hospital patients. A major factor in these results appears to be 
the role of tranquilizers in making patients more amenable to psychotherapy 
and indirectly improving staff morale. Other, more direct effects of such drugs 
have not yet been conclusively established in adequately controlled experi 
ments. Environmental therapy includes the therapeutic manipulation of en 
vironmental factors, both in daily life (such as home, school, job, and recrea 
tional contexts) and in institutions. A growing emphasis on milieu therapy 
is rapidly transforming mental hospitals from custodial institutions to thera 
peutic communities. 


Psychotherapy is practiced in many forms, reflecting the varied theoretical 
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orientations of the therapists. In the most general terms, it can be described 
as the use of verbal or other means of communication in a nonthreatening 
interpersonal situation which facilitates the alleviation of anxiety and the 
climination of maladaptive behavior through a learning process. Two widely 
used types of psychotherapy are psychoanalysis and client-centered therapy. 
Group therapy is being employed increasingly, either in place of individual 
psychotherapy or in conjunction with it. Psychodrama and role playing are 
special variants of group therapy. Play therapy is commonly utilized with 
children, in lieu of the predominantly verbal techniques employed with adults. 
The current mental health movement is concerned not only with the treatment 
and rehabilitation of institutionalized mental patients but also with preventive 
measures and with the fostering of positive mental health through compre- 
hensive social and educational programs. 
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Clinical Research 


In the preceding chapter, attention was called to the pressing need for 
research in the field of mental health. This need stems partly from the fact 
that few techniques currently in use for diagnostic or therapeutic purposes 
have been adequately validated. In addition, it is hoped that basic research 
will open up new approaches to problems of mental health. The Joint Com- 
mission on Mental Illness and Health emphasized the importance of basic 
rescarch in mental health programs (52, p. viii). The same emphasis is to 
be found in the reports of special conferences dealing with the role of psychol- 
ogists in mental health (111) and the training and functions of psychologists 
in general (37). Moreover, it is in research that the clinical psychologist, as 
contrasted to other mental health professionals, can make his most distinctive 
contribution (37, 111). Even in his service functions, the clinical psychologist 
with a research orientation can operate more effectively than one who is 
trained only as a practitioner. With research training, he is better able to 
evaluate innovations and to incorporate new developments into his practice. 
In a field that is rapidly evolving, rigid commitment to existing techniques 
can only retard progress. 

Before World War II, the gap between clinical and experimental psychol- 
ogy was very wide. The rapid growth of clinical psychology since that time 
has been accompanied by some rapprochement between the two fields. The 
gap is still evident, but it has narrowed. 'To be sure, psychologists differ in 
their reaction to this gap. Some deplore its existence and urge further narrow- 
ing. Others insist it is inevitable and argue for a realistic acceptance of two 
kinds of psychologists. Nevertheless there are promising signs that psychol- 
ogists can function effectively in the dual role of scientist-practitioner. Clini- 
cians, on the one hand, have become more critical about their techniques and 
more willing to test their hypotheses. Experimentalists, on the other, have 
been doing more and more research on problems arising out of clinical con- 
texts. 

A particularly clear sign of this rapprochement can be seen in the increas- 
ing interest in the development or identification of statistical techniques suit- 
able for clinical data (13, chs. 7 and 8; 14; 46; 110; 121). Several of these 
techniques are designed for the intensive study of single cases. Others make 
it possible to handle several variables simultaneously, a situation likely to occur 
in naturalistic clinical settings. Procedures for investigating interaction effects 
among different variables, for measuring complex nonlinear relations, and for 
analyzing patterns of responses are further examples. The growth of non- 
parametric or distribution-free statistics is also a boon to the clinical researcher, 
whose data often fail to meet the assumptions underlying traditional statistical 
formulas. A related development is to be found in the increasing number of 
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publications dealing with the experimental designs required in clinical studies 
Ceg., 21, 120). 

In the preceding chapters, some research findings were cited in connection 
with diagnostic and therapeutic procedures. In the present chapter, however, 
the focus will be on methodological problems and experimental design in 
clinical research. Because of the wide scope of clinical research and the variety 
of problems it encompasses, this chapter can do no more than sample the types 
of problems investigated and present one or two illustrative studies of each 


type. 


VALIDATION OF DIAGNOSTIC TECHNIQUES 


A major area of applied research in clinical psychology pertains to the 
validation of tests and other diagnostic techniques. Because of the extensive 
use of projective techniques in clinical practice and the many unverified 
claims made about them, these techniques offer many opportunities for re- 
search. Since the validation of these techniques also presents many method- 
ological pitfalls, it provides good illustrative material for the purposes of the 
present chapter (see 4, ch. 20). 

Inadequate experimental designs or inappropriate statistical analyses may 
have the effect of producing spurious evidence of validity where none exists. 
An example is the contamination of either criterion or test data. Thus the 
criterion judges may have access to the subjects! test protocols (see Chapter 
2). Conversely, the examiner may have obtained cues about the subject's 
behavior from remarks made by the subject while taking the test, or from 
case-history material or other non-test sources. The best control for the latter 
type of contamination in validation studies is provided by blind analysis, in 
which the test record is interpreted by a scorer who has had no contact with 
the subject and who has no information about him other than that contained 
in the test protocol. 

A more subtle source of error is illustrated by stereotype accuracy. Certain 
descriptive statements, such as might occur in a Rorschach protocol, may 
apply widely to persons in general, or to young men, or to hospitalized mental 
patients, or to whatever category of subjects is sampled by the particular in- 
vestigation. Agreement between criterion and test data with regard to such 
statements would therefore yield a spurious impression of validity. Some con- 
trol for this error is needed, such as a measure of congruence ‘between the 
test evaluation of one subject and the criterion evaluation of another, control 
subject in the same category (see, e.g., 104, p. 9). The spurious validity arising 
from stereotype accuracy will be recognized as somewhat similar to the 
“Barnum effect" described in Chapter 2. 

From a different angle, an individual's responses may depend more upon 
the specific testing technique employed than upon his general behavioral 
characteristics. Insofar as behavioral descriptions based upon a single testing 
technique have validity, they should exhibit stability across techniques. To 
check on this aspect of validity, Campbell and Fiske (11) proposed a 
multitrait-multimethod validation process whereby the correlations of the same 
traits measured by different methods are compared with the correlations of 
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different traits measured by both the same and different methods. Common 
variance due to method might result from response sets and other test-taking 
attitudes that would increase the uniformity of the subject's performance on 
a particular test or type of test. Unless the trait correlations are appreciably 
higher than the method correlations, there is obviously no point in describing 
an individual in terms of the given trait categories. For example, if aggressive- 
ness and obsessiveness as measured by Test A correlate higher than do measures 
of aggressiveness derived from Tests A and B, there is some question about the 
value of the construct "aggressiveness" as measured by these two tests. 

Meehl (68) describes an adaptation of the Campbell-Fiske procedure in- 
volving essentially the correlation between Q sorts of two persons evaluated 
on the same test Ce.g., the Rorschach) and the correlation between Q sorts 
of the same person evaluated on two different tests (e.g, Rorschach and 
TAT). The first correlation would show the degree of similarity between 
the trait patterns of two different persons and would reflect stereotypes and 
sorting bias arising from the test and the scorers. The second correlation would 
show the degree of similarity between the trait patterns of the same person 
as measured by two different tests. The latter could be scored by the same or 
different scorers. 

A common source of spurious validity is failure to cross-validate (see Chap- 
ter 2). Because of the large number of possible diagnostic signs that can be 
derived from most projective techniques, it is likely that by chance alone 
some of them will discriminate significantly between criterion groups. It 
should be recalled that by chance approximately 5 per cent of the diagnostic 
signs investigated ought to discriminate at the 5 per cent level of significance, 
1 per cent at the 1 per cent level, and so on. The validity of a scoring key 
based on such diagnostic signs, however, would collapse to zero when the 
test is cross-validated in a new sample. 

The traditional interpretations of many projective test responses are ex- 
pressed in the form of “dispositional constructs” (98, p. 229), such as “un- 
conscious feelings of inferiority.” Such interpretations cannot be fully validated 
unless reformulated in terms of observable behavior as, for example, “will 
buy a more expensive car than he can afford” or “will drop names of impor- 
tant persons” (98, p. 229). Without these behavioral predictions, we can 
determine only whether two independent observers or scorers arrive at the 
same dispositional construct by applying different diagnostic procedures to 
the same individual. Agreement under these conditions may be looked upon 
as a first step toward validation. But such validation must be regarded as 
tentative, because the agreement may simply reflect a common theoretical 
bias on the part of the two observers, a bias that is broad enough to encompass 
different diagnostic techniques. 

Inadequacies of experimental design may also have the effect of under- 
estimating the validity of a diagnostic instrument. One example is provided by 
the use of an inadequate criterion. In Chapter 13 we noted various objections 
to the application of such traditional psychiatric categories as schizophrenia 
and anxiety hysteria. If this type of diagnostic category is used as the sole 
criterion for checking the validity of a personality test, negative results are 
inconclusive. A lack of correspondence in this case might indicate the weak- 
ness of the criterion rather than the invalidity of the predictor. Another im- 
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portant question pertains to the skill, training, and experience of the clinician 
who interprets the test protocol. Because of the active part played by the 
clinician in this interpretation, test validity cannot be evaluated independently 
of certain clinician variables. 

Insignificant validity coefficients may also result from a failure to take 
interaction effects into account. For example, the hypothesized relation be- 
tween aggression in fantasy (as revealed, for instance, in TAT stories) and 
aggression in overt behavior may be too complex to be tested by a single 
validity coefficient. Depending upon other concomitant personality character- 
istics, high aggression in fantasy may be associated with either high or low overt 
aggression. "There is some evidence to suggest that, if strong aggressive ten- 
dencies are accompanied by high anxiety or fear of punishment, expressions 
of aggressions will tend to be high in fantasy and low in overt behavior. 
When anxiety and fear of punishment are low, however, high fantasy aggres- 
sion is associated with high overt aggression (83, 87). 

A lack of significant correlation between expressions of aggression in TAT 
stories and in overt behavior in a random sample of cases would thus be 
consistent with the stated hypothesis, since the relation is positive in some 
individuals and negative in others. Obviously, however, such a lack of cor- 
relation is also consistent with the hypothesis that the TAT has no validity 
in detecting aggressive tendencies. A similar methodological problem is illus- 
trated by hypotheses of defensiveness, which maintain that in the more highly 
defensive individuals overt behavior should be negatively correlated with 
strength of emotional needs, while in the less defensive individuals overt be- 
havior and emotional needs should be positively correlated (116). What these 
problems require, of course, is more complex experimental designs that permit 
the measurement of interaction effects through an analysis of results in pre- 
viously identified subgroups (83, 87, 116). 

Still another condition that may lower the validity indices obtained with 
projective techniques pertains to the unevenness of coverage of such proce- 
dures. Because of their unstructured nature, projective techniques may provide 
different kinds of information regarding different individuals—or even regard- 
ing the same individual when evaluated by different scorers (16, p. 129; 44 
P. 22; 68, p. 21). One person's TAT responses, for example, may reveal a 
good deal about his aggression and little or nothing about his achievement 
drive; another person's TAT record may permit a thorough assessment of 
the strength of his achievement drive, while indicating little about his aggres- 
sion. This lack of uniformity in the kinds of information yielded in individual 
cases may help to explain the low validities found when projective test re- 
sponses are analyzed for a single trait across a group of persons. Similarly, O 
sorts force the scorer to evaluate each subject in all traits, a requirement that 
is clinically unrealistic (68, p. 21). 

Much of the published literature on the evaluation of projective tests is 
inconclusive because of one or more of the methodological difficulties men- 
tioned above or because of the use of inappropriate statistical techniques (4, 
ch. 26; 15; 25). On the other hand, a few recent studies reflect the rapproche- 
ment of clinical and experimental psychology in the sophistication of their 
experimental designs (see, e.g., 44, 62, 104, 116). 

For purposes of illustration, we may consider an investigation of four per 
sonality tests conducted by Little and Shneidman (62). First, a single exam- 
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iner administered the Rorschach, TAT, MAPS, and MMPI! to each of 12 
adult male patients in a Veterans Administration hospital. The 12 subjects 
included three in each of the following diagnostic categories: neurotic, psy- 
chotic, psychosomatic, and normal (fracture and surgery cases with no record 
of emotional disorders). A detailed case history, or anamnesis, for each subject 
was also obtained independently through a series of intensive interviews. 

Both test and anamnestic data were independently analyzed by two groups 
of judges, designated as test judges (TJ) and anamnesis judges ÇAJ). The 
former comprised 48 clinical psychologists, selected from lists of recognized 
experts on each of the four tests employed; the latter included 23 psychiatrists 
and one psychologist. Each TJ performed a blind analysis for four subjects 
on the test for which he was a skilled interpreter. Each AJ prepared a blind 
anamnestic evaluation for two subjects. The only additional information 
available to either test or anamnesis judges was that the subjects were all 
white, Protestant, male veterans between the ages of 22 and 33 years and with 
high school education. Specifically, both test and anamnesis judges performed 
the following five tasks: (1) diagnostic labeling of the subject; (2) rating 
subjects degree of maladjustment on a linear scale; (3) completing a 76- 
item Q sort on the subject; (4) answering 117 truefalse items about the 
subject, typical of statements made in psychological test reports, and indicating 
degree of confidence in each answer; and (5) answering 100 true-false items 
of a relatively factual nature about the subject's past and present life and 
reporting degree of confidence in each answer. To provide a measure of 
scorer reliability, each TJ repeated the entire analysis of one case after a 
lapse of 10 days, 

The results were analyzed statistically to measure congruencies among the 
personality evaluations of each subject as determined by the different tech- 
niques and different judges. With regard to diagnostic labels, there was little 
correspondence between the categories assigned by the test judges and the 
subject's previously established diagnosis. The test judges also disagreed widely 
among themselves. There was a strong tendency, moreover, to classify the 
normal subjects as neurotic or psychotic and to rate them as maladjusted. This 
tendency probably reflects in part the strongly pathological orientation in the 
experience of the clinicians and possibly in the context in which the tests were 
developed. 

Of particular interest are the results of the more detailed analyses of Q 
sorts and other personality descriptions. Although somewhat higher than 
chance, the degree of agreement among the different evaluations was quite 
low. Not only were congruencies between anamnestic and test results low, but 
little agreement was also found from one test to another and even among 
different judges using the same test. The ro-day reliability check showed that 
individiual judges agreed closely with their former evaluations with regard to 
diagnostic label and degree of maladjustment, but not with regard to detailed 
personality descriptions. All in all, the findings of this study raise considerable 
doubt about the clinical value of the instruments investigated. On the other 
hand, it should be noted that this investigation does not meet all the method- 
1 The MMPI is a self-report personality inventory, described in Chapter 13: the 
Rorschach, TAT, and MAPS are projective techniques, the first two of which were 
also described in Chapter 13. In the MAPS (Make A Picture Story) technique, the 
subject selects cardboard figures and a backdrop, which he assembles on a miniature 
stage, and then tells a story about them. 
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ological objections raised against clinical validation studies. For example, the 
Q sorts and truefalse statements may have imposed unrealistic and unduly 
demanding conditions upon the test interpreters, by requiring them to evaluate 
all traits for all individuals. 


STUDIES OF THE PROCESS OF PSYCHOTHERAPY 


Research on the process of psychotherapy has been concerned with such 
questions as the effect of therapist variables upon the psychotherapeutic 
process, the specific ways in which therapists function, and the changes in 
client behavior that occur in the course of psychotherapy. In contrast to studies 
of the outcome of psychotherapy, to be discussed in the next section, this 
research is limited to what goes on during the psychotherapeutic sessions 
themselves. Most of this “process” research has been conducted by client- 
centered psychotherapists (see 90; 95, pp. 96-107), although the same p.o- 
cedures have been applied to a more limited extent to psychoanalytic 3 :d 
other forms of psychotherapy (6, 18, 29, 31, 80, ror, 112, 113). 

Changes in the Course of Therapy. Process research has been greatly 
advanced by the application of two techniques, namely, the recording of 
psychotherapeutic sessions and content analysis. With the client’s permission, 
therapeutic sessions have been recorded by phonographic, wire, or tape re- 
corders and then transcribed. When these records are employed for either 
research or teaching purposes, of course, all identifications of individual clients 
are eliminated. Content analysis is a technique for classifying the meaningful 
content (as opposed to grammatical and other formal properties) of the verbal 
responses of client and therapist. Several schemas of classification have been 
worked out for this purpose, the specific categories varying with the objectives 
of Es study and the theoretical framework of the investigator (6, 18, 20, 8o, 
106). 

A typical study conducted with client-centered psychotherapy will serve to 
illustrate the procedure. Seeman (100) analyzed over 6,500 client and therapist 
statements taken from 6o therapeutic sessions with ro clients. Using a previ- 
ously developed schema (106), he classified client responses into four cate- 
gories: Cr) expressions of problems or symptoms, (2) simple acceptances of 
therapist response, (3) indications of understanding or insight, and (4) dis- 
cussions or explorations of plans for the future. The frequency of each of the 
four types of response was determined separately for successive fifths of the 
psychotherapeutic process in order to reveal any changes that occurred during 
therapy. The results can be seen in Figure 102. In the course of psychotherapy, 
the percentage of statements dealing with problems or symptoms declined 
sharply, while the percentage of those showing understanding or insight rose. 
Discussions of future plans also increased during the later sessions. Simple 
acceptance of therapist response rose in the early part of therapy and then 
declined slowly. 

As a further index of the effectiveness of psychotherapy, Seeman computed 
the ratio between number of statements showing understanding or insight 
and the sum of these statements and those expressing problems or symptoms. 
When the data from all clients were combined, this ratio rose from the first to 
the last fifth of the psychotherapeutic process. Moreover, the rise tended to be 
greater for those clients whom the therapists rated as most improved than for 
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Fig. 102 Changes in client responses during client-centered psychotherapy. 
CData from Seeman, 100, p. 161.) 


those rated as least improved. The latter finding, however, may be a methodo- 
logical artifact, since the therapist's rating of client improvement was probably 
influenced by client statements in the course of therapy. Another finding of 
the same study was that attitudes expressed by clients in the present tense 
tended to become more positive and less negative in the course of therapy, 
while attitudes expressed in the past tense showed the reverse change. Ap- 
parently the past was viewed with less favor and the present with increasing 
optimism following therapy. 

Several other progressive changes have been identified in the course of 
psychotherapy. Defensiveness tends to decrease as therapy proceeds (39, 79). 
Similarly, expressions of hostility increase as the anxiety associated with such 
feelings is reduced through therapy (79). Such findings suggest an increasing 
ability to recognize and express feelings freely in the course of therapy. 

A number of studies by psychologists of the client-centered school have been 
concerned with changes in the client's self. concept. With therapy, the self 
concept tends to become more favorable and comes to resemble more closely 
the individual's self-ideal concept, or the way he would like to be. These 
changes have been investigated by having the client perform a O sort with 
reference to his real self as he perceives it and his ideal self, at different stages 
of therapy. In one study of 25 cases (10), the mean correlation between self 
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and ideal sorts before therapy was —.or, as contrasted with a mean correlation 
of 58 found with normal control subjects. After therapy, the mean client 
correlation rose to .38. In a follow-up after an interval of six months to a year, 
this mean showed a negligible drop to .31. The mean control correlation re 
mained virtually unchanged over the time intervals covered. 

Although the observed changes in selfacceptance in the course of psycho: 
therapy are consistent with the underlying rationale of client-centered therapy, 
their interpretation is not so simple as might seem at first sight (see 17). One 
difficulty arises from the fact that different indices of self-acceptance do not 
correlate very highly with each other and hence may be measuring different 
constructs. Much more information is needed on the generality of self- 
acceptance with reference to measuring techniques, traits or items, situations, 
and time. Another difficulty is that the meaning of high self-acceptance scores 
is ambiguous. In such scores, self-acceptance may be confounded with de 
fensiveness, lack of insight into one’s emotional difficulties, or the tendency to 
choose socially desirable answers on personality inventories. 

Therapist Variables, In the effort to learn more about the therapeutic 
relationship, some investigators have studied the effects of therapist charac- 
teristics and therapist responses upon client behavior during therapy. For ex 
ample, certain characteristics of the therapist’s responses, such as the accuracy 
with which he reflects client feelings or the degree of his unconditional positive 
acceptance of the client, have proved to be significantly related to the extent 
of intrapersonal exploration that the client undertakes during group therapy 
(114). Some studies on individual psychotherapy have found that “insight” 
and “discussion of plans” by the client are more likely to follow nondirective 
than directive therapist responses and that resistance to the therapist and to 
the therapeutic process more often follow directive responses (95, pp. 247 
259; 106). 

Of particular interest is the evidence of interaction among client, therapist, 
and process variables (95, pp. 247-259; 107). These findings suggest that, 
because of their individual personality characteristics, some therapists may 
function more effectively with one type of therapy, while others may function 
more effectively with a different type. Similarly, pre-therapy personality char 
acteristics of clients affect their responses to different therapists and to different 
therapeutic processes. The importance of compatibility between therapist and 
client is indicated by this research. There is also evidence of a close cor- 
respondence between client's feelings toward therapist and therapist's feelings 
toward client. In an intensive study of the psychotherapeutic relationship be 
tween one therapist and 20 clients (107), both therapist and clients filled out 
specially constructed "affect scales" pertaining to their feelings toward each 
other, following each therapeutic session. For all sessions combined, the cor 
relation between client and therapist affect for the 20 cases proved to be .70. 

Still another approach is illustrated by an investigation of the effect of 
therapist response upon the frequency of client responses in different content 
categories (8o). This study is of particular interest because it demonstrates 
that even in nondirective therapy the subtle approval or disapproval implicit 
in the therapists remarks influences the subsequent frequency of various kinds 
of client responses. Among the response categories that were consistently 
approved by the therapist were expressions of client independence; among 
those that were disapproved were intellectual defenses and feelings of anxiety 
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about independence. A comparison of the pooled categories of approved re- 
sponses with the pooled categories of disapproved responses over eight hours of 
psychotherapy yielded the results shown in Figure 103. It can be seen that the 
proportion of approved responses rose sharply, while the proportion of dis 
approved responses dropped sharply, in the course of therapy. That subtle 
verbal reinforcement by the therapist will significantly affect the subsequent 
course of the client's responses has been repeatedly confirmed by other in- 
vestigators e.g., 1; 88; 91; 96; 97; 113, pp. 61-94). 

Physiological Measures. The research considered thus far has dealt ex- 
clusively with verbal responses. Considerable information is also available re- 
garding changes in physiological indices of anxiety during psychotherapy. Both 
laboratory and clinical studies provide extensive evidence showing that such 
physiological measures as galvanic skin reaction, heart rate, blood pressure, 
muscle tension, and gastric motility can serve as indicators of emotional 
conflict areas during therapeutic sessions, can indicate quantitatively the extent 
of emotional involvement, and can reflect changes in client behavior as therapy 
progresses (78, chs. 18-19; 95, pp. 160-208). Such physiological changes can 
be detected in moment-by-moment comparisons within a single therapy session, 
as well as in the comparison of different sessions. 

In view of the foregoing findings, it would seem that physical indices of 
anxiety should provide a good measure of the effectiveness of therapy. Several 
difficulties arise, however, when the indices are employed for this purpose (65; 
95, pp. 160-208). First, the intercorrelations among different anxiety indicators 
are rather low. There is considerable specificity in the autonomic response 
pattern of different persons (57; 95, pp. 160-208). Thus one individual might 
consistently show his maximal response to stress in blood-pressure rise, another 


Fig. 103 Effect of therapist approval or disapproval upon frequency of 
client statements in different categories. (From Murray, 80, p. 11.) 
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in heightened muscle tension, and a third in increased finger sweating. It 
would therefore be misleading to employ a single physiological variable for 
all cases. 

A second difficulty stems from the fact that physiological indicators of 
anxiety cannot be taken directly as indices of therapeutic effectiveness but must 
be interpreted in the light of other behavioral data (95, pp. 160-208). For 
instance, physiological measures cannot differentiate between the reduction 
of stress resulting from neurotic defenses and that resulting from the resolu 
tion of conflicts. Many psychotherapists maintain that anxiety should rise and 
then decline in the course of therapy, as defensive symptom systems are 
broken down and new responses are developed. It is apparent that anxicty 
indicators need to be considered in combination with other information about 
the client and interpreted against previously formulated theoretical €x- 
pectancies. 

A final limitation is common to all measures taken during the therapeutic 
process, whether they be physiological or verbal. Any change observed in the 
course of psychotherapy may be restricted to the therapeutic situation. The 
therapist’s nonthreatening and accepting attitude, for instance, may gradually 
lead to a reduction in the anxiety the client experiences while talking to the 
therapist about emotional problems. It is, of course, an underlying assumption 
of therapeutic practice that these response changes will generalize from talking 
to other behavior and from the therapeutic situation to other situations of daily 
living. On theoretical grounds, there is good reason to expect this generaliza- 
tion to occur. But it must be recognized that process studies as such can 
provide no evidence for such generalization. 


EVALUATING THE OUTCOME OF THERAPY 


A particularly difficult field of clinical research is that concerned with the 
evaluation of therapy. Although hundreds of studies have been published in 
this area, few have yielded conclusive results. Serious methodological problems 
make it difficult to determine the effectiveness of somatic therapies as well as 
psychotherapy. A review of over 500 publications on a leading tranquilizer, for 
example, found only 37 studies that met even minimum standards of scientific 
acceptability (43). Surveys of published studies on psychotherapy with adult 
neurotics (24; 26, ch. 18) and with children (59) failed to find satisfactory 
evidence that psychotherapy facilitates recovery. To be sure, that which is 
unproved is not thereby disproved. When small samples are employed, for 
example, only very large effects can be significantly established. Much larger 
samples than are ordinarily available would be required to demonstrate slight 
treatment effects (32, p. 187). Similarly, unreliable measuring instruments, 
inadequate criteria of improvement, or the unskilled application of a thera- 
peutic technique may artificially reduce the difference in outcome between 
treated and untreated samples. 

Criteria of Outcome. One of the most persistent problems in outcome 
research centers around the criteria of outcome (113, pp. 320-324; 119). How 
can we judge whether or not the individual has improved? One way is to ask 
the client about his own feelings. Such information could be obtained either 
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through an informal report or through a standardized self-evaluation, as in a 
personality inventory or Q sort. A difficulty with this approach arises from 
defense mechanisms, which may make the client's self-report totally mislead 
ing. Another source of error has been described as the “hello-goodbye” effect 
C42). When a client is seeking therapy, he is likely to exaggerate his difh- 
culties because of a subtle social obligation to justify his appeal for help; when 
about to terminate therapy, on the other hand, he is likely to overestimate his 
improvement and satisfaction out of courtesy and gratitude toward the therapist. 
For many persons, it may be difficult to say they feel no better after a pro- 
fessionally trained therapist has devoted many hours to helping them. Sugges- 
tion would also operate, insofar as the individual expects to benefit from 
therapy. 

If we rely upon the therapist’s evaluation of the patient's improvement, we 
encounter similar difficulties. Being ego-involved in the therapeutic process, 
the therapist cannot be considered an unbiased observer. Having diligently 
applied a form of therapy in whose efficacy he believes, the therapist cer- 
tainly expects to find improvement. Although not subject to this bias, objective 
measures of verbal and physiological changes during therapy, such as those 
employed in process research, have the limitation that they may be specific 
to the therapeutic situation. There is some evidence to suggest that client 
responses during therapeutic sessions may be unrelated to external indices of 
progress (50). 

A fourth type of outcome criterion is based upon the ward behavior of 
institutionalized patients. Several objective rating scales have been developed 
to evaluate this behavior. One such scale consists of 150 items to be marked 
by nurses in reporting on the patient’s ward behavior during the preceding 
week (9). To what extent improved adjustment to the sheltered institutional 
environment will generalize to behavior outside the institution must of course 
be checked for specific therapies and for different kinds of patients. 

A fifth and final type of outcome index utilizes follow-up data regarding the 
individual's behavior in real-life situations. Such data may range from length 
of time the individual remains outside the hospital to evidence of job and 
family adjustment and associates’ evaluations of interpersonal behavior. Al- 
though theoretically this type of criterion would seem to be the most satis- 
factory, it is difficult to apply because of inequalities in the environments to 
which different persons must adjust. The experimenter has no control over the 
circumstances of the individual's life and little knowledge about them. A 
patient may return to the hospital within six months or may fail to adjust to 
his job, not because of the inadequacy of the therapy, but because of the un- 
usually stressful nature of his environment. 

In view of the many special problems presented by each criterion of out- 
come, it is not surprising to find that the intercorrelations among different 
types of criteria are often low (32). A patient may be greatly improved 
according to one criterion and unimproved according to another. In a well- 
controlled study of the relative effectiveness of three forms of psychotherapy 
with patients in a Veterans Administration hospital, there was little relation 
between outcome criteria based on ward behavior and those based on a six- 
month follow-up after discharge; and there was little relation between these 
criteria of outcome and those based on self-report inventories or Q sorts (27). 
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Even among follow-up criteria, moreover, the relative effectiveness of different 
therapies differed depending upon whether a hospitalization or an employment 
criterion was chosen. 

Duration of follow-up also makes a difference in the results. A survey (109) 
of many studies on shock therapies and psychosurgery showed a considerably 
higher recovery and improvement rate among treated than among untreated 
cases immediately following therapy. But the difference declined with time 
and had virtually disappeared within five years (see Figure 104). These 
somatic therapies may thus hasten improvement by patients who would 
eventually recover if untreated. Even this effect, of course, is a notable ac- 
complishment if it can be conclusively established. 

It should also be noted that the choice of a criterion of outcome involves 
some value judgment (22; 95, pp. 1-9). The question of what constitutes 
mental health, discussed in Chapter 14, is also relevant to the evaluation of 
therapy. Therapists differ, for example, in the relative value they place upon 
individual freedom and upon conformity and adjustment as goals of therapy. 
Clients also vary with regard to these values, partly as a function of socio 
economic background and other cultural factors. Such value differences repre- 
sent one of the variables in the compatibility of client and therapist that may 
in turn affect the success of psychotherapy. All of the foregoing problems 
indicate the desirability of investigating the outcome of therapy in terms of 
multiple criteria and of specifying fully the criteria employed in each study. 

Selection of Subjects. A second major methodological question pertains 
to the choice of both experimental and control subjects. Because of the likeli- 
hood of interaction between client variables and types of therapy, it is im- 
portant to work with groups or subgroups that are homogeneous with regard 
to relevant variables. Which variables are relevant depends somewhat upon 
the type of therapy under investigation and the objectives of the study. Client 
characteristics that are likely to make a difference can be identified in part 
from earlier empirical findings and in part from the theoretical rationale 
underlying the therapy. Any study, however, should probably control, as a 
bare minimum, such variables as age, sex, socioeconomic level, duration of 
illness, type of onset (sudden or gradual), and diagnostic category (95, pp- 
10-26; 120). The expected improvement and recovery rates among untreated 
cases vary as a function of these patient characteristics. For instance, the base 
rate of improvement is considerably higher for manic-depressives than for 
schizophrenics (32, 109). When feasible, of course, more precise specifications 
regarding Symptoms, emotional problems, and adjustment mechanisms are 
desirable, in view of the inadequacies of traditional diagnostic categories. 

In any evaluation of therapy, it is particularly important to use a control 
group matched with the experimental group in all of the previously mentioned 
variables. It is the function of the control group to indicate what improvement 
Cor deterioration) would occur in the absence of the specific therapy under 
investigation. Changes occurring in the control group should reflect “spon 
taneous recovery” occurring as a result of nonspecific treatments (such as 
institutionalization) and of the many uncontrolled and unknown events in 
the individual's day-by-day experiences. 

Unless special care is exercised in matching treated and control groups, 
selective bias is likely to influence group assignments. There is a strong 
tendency to place in the control groups cases with poorer prognosis (109). It 
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has been demonstrated that the recovery rate of control groups used in studies 
of somatic therapies is significantly lower than that of the general untreated 
mental hospital population prior to the introduction of these specific therapies. 
Figure 104 provides a striking demonstration of this difference, based upon 
a careful analysis of published surveys of schizophrenics that were conducted 
prior and subsequent to the introduction of shock therapies and psychosurgery. 
It can be seen that, in the five-year follow-ups, there is virtually no difference 
between treated and general untreated samples, while the control groups 
employed in the various investigations of these therapies showed a much lower 
recovery rate. 

In experiments with outpatients, control cases are sometimes drawn from 
waiting lists. This procedure presents several difficulties. It is likely that many 
such cases will obtain help elsewhere, especially if the waiting period is long. 
They may in any event lose contact with the clinic and be unavailable for 
follow-ups. The fact that they have not been accepted for therapy may 
heighten the feeling of rejection that so often characterizes their interpersonal 
attitudes. 

The waiting period may also be utilized to provide an own-control for cases 
later accepted for therapy. In a relatively well-designed experiment using this 
procedure, 22 self-referred neurotic clients were compared during equal 
periods of waiting and of nondirective therapy (12). The waiting period 


Fig. 104 Outcome for institutionalized schizophrenics given custodial 
treatment during pre-therapy era and for those used as controls in 
somantic-therapy studies. ( Adapted from Staudt and Zubin, 109, p. 185.) 
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ranged from 3 to 24 weeks and averaged slightly under 9 weeks. The over-all 
results showed a significant advantage of therapy over waiting period as re 
flected by Q sorts but not by TAT performance. Improvement during therapy 
was greater with experienced than with inexperienced therapists. These find- 
ings need to be qualified by the limitations inherent in outcome indices ob- 
tained during therapy sessions, by any attitudinal effects of the waiting period, 
and by the possible influence of temporal and sequential factors. 

Experimental Design. There are many problems of experimental design 
that must be considered in planning or evaluating any study on the effects of 
therapy (see, e.g., 21; 115, chs. 5 and 6). Some of these problems are common 
to all outcome research in clinical psychology; others are specific to the evalua- 
tion of a particular form of treatment. As examples, we shall consider two 
problems, one pertaining chiefly to psychotherapy and the other to drug 
therapy. 

With regard to psychotherapy, it is difficult to separate the influence of a 
specific therapy from the more general effect of any sort of therapeutic atten 
tion. The latter underlies “total push” and other milieu therapies. Control 
subjects who receive no discernible form of treatment may deteriorate or fail 
to improve because of feelings of hopelessness and neglect. It is largely for 
this reason that several investigators have advocated—and some used—one 
form of psychotherapy as a control for another. In some studies, different 
therapies are simply evaluated against each other; in others, a minimal “con 
trol" therapy is introduced, which is less time-consuming and more super- 
ficial than the other forms of psychotherapy (see, e.g., 27; 95, pp. 10-26). 

In experimental designs calling for multiple therapies, the role of the 
therapist must be considered. If different therapists are employed with cach 
type of therapy, differences among therapists will be confounded with differ 
ences among therapies. If the same therapists are assigned to all therapies in 
some balanced design, the interaction of therapist and type of therapy should 
be investigated. It is likely that each therapist will be most successful with the 
therapy he prefers and in which he is most experienced. In addition, it is 
highly desirable for outcome studies to be so designed as to permit the measure 
ment of interactions between client characteristics and type of therapy (see 
21). One form of psychotherapy may be more effective with certain types of 
disorders or certain kinds of persons; another form may be more effective with 
others. i 

In the evaluation of drug therapies, control procedures must be such as to 
rule out the placebo effect (63; 103; 115, ch. 6). The term “placebo,” derived 
from a Latin verb meaning “to please” or “to placate,” originally referred to a 
pharmaceutically inactive substance prescribed when no specific remedy was 
available for the patient's condition. More recently, placebos have come to be 
employed in drug experiments to control the effects of suggestion on both sub 
jects and observers, The customary procedure is to administer a placebo to 
some subjects and the experimental drug to others in a form that makes the 
two indistinguishable. In the “double-blind” technique, the identity of drug 
and placebo (or control) cases is not revealed to either subjects or observers 
Ce.g., therapists, nurses, attendants, examiners). 

Some drugs present a more difficult control problem because they produce 
recognizable side effects, such as dizziness or motor incoordination. It would thus 
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be an easy matter for subjects and observers to determine who had received 
the placebo and who had received the drug. Under these circumstances, it is 
desirable to use an "active placebo," which simulates the side effects of the 
drug but does not contain the specific pharmaceutical agent under investigation. 
This problem is not likely to arise in research on the common tranquilizers and 
energizers, unless very large doses are employed. 

The need for stringent control procedures in drug research was vividly 
demonstrated in a simulated drug-therapy study conducted with 120 hospitalized 
psychiatric patients (63). With the exception of the investigators and the 
medical director of the hospital, all staff members and participating patients 
were under the impression that a new tranquilizer and a new energizer were 
being evaluated. Although the two kinds of tablets differed in size, color, and 
taste, both kinds actually contained the same inert placebo. On the basis of 
their specific psychiatric disorders, 6o patients were selected for the "tran- 
quilizer" study and 6o for the "energizer" study. Each group was divided into 
matched experimental and control groups, the latter receiving no tablet. The 
"drugs" were administered for a 6-week period, preceded and followed by 2- 
week baseline periods with no drug. Weekly ratings on a behavioral rating 
scale were recorded for each patient by the nurses. Patients, psychiatrists, and 
nurses also made a subjective evaluation of the degree to which each patient 
had been helped by the therapy. 

According to the subjective evaluations, from 53 to 80 per cent of the 
patients appeared to have benefited from these "drugs." Such findings are not 
unlike those reported in some equally subjective and uncontrolled studies in 
support of the psychiatric use of certain drugs. In terms of the better controlled 
comparison of behavior ratings of experimental and control groups, the alleged 
tranquilizer produced a statistically significant improvement. The alleged 
energizer failed to do so, probably because energizers were newer and less 
widely accepted than tranquilizers at the time of the study. All that the results 
demonstrate, of course, is the effect of suggestion upon patients, staff, or both. 


RESEARCH ON THE CAUSES OF 
MENTAL DISORDERS 


Investigations of the nature and causes of intellectual and emotional disorders 
constitute a large portion of the basic research underlying clinical psychology. 
The phenomenal growth of clinical psychology since World War II has been 
accompanied by a vigorous and comprehensive attack on problems of etiology. 
This research has drawn upon a multitude of disciplines, from genetics and 
biochemistry to sociology and communication theory. Because the major em- 
phasis in this book is on applied rather than on basic research, etiological 
studies will be sketched rather briefly, in order merely to suggest the varied 
approaches that are currently being explored. . ys i 

Especially promising are certain large-scale interdisplinary projects, such as 
the Schizophrenia and Psychopharmacology Project conducted jointly at the 
University of Michigan and the Ypsilanti State Hospital (35). Believing that 
progress in the understanding and treatment of schizophrenia requires the 
identification of different syndromes that may be loosely grouped under this 
category, the investigators organized the project around the problem of classi- 
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fication. For the purposes of this study, a special research ward was established, 
in which diet and other details of patient care were carefully controlled. For 
each experimental patient, as well as for several control groups, data werc 
gathered through psychiatric interviews, case histories, records and ratings of 
ward behavior, a specially developed battery of psychological tests, and an 
extensive series of physiological and biochemical measures. Preliminary 
analysis of the intercorrelations among several hundred variables thus measured 
revealed certain clusters that cut across biological, psychological, and clinical 
indices. These clusters differentiated significantly among such diagnostic groups 
as paranoid schizophrenics, other schizophrenics, patients with chronic brain 
syndrome, and other nonschizophrenic psychotics. Longitudinal studies also 
indicated that some of these clusters tended to shift under drug action, thus 
providing further corroboration of the concomitance of certain measures. This 
project also covered many studies of a more specific nature, dealing with bio 
chemical, physiological, and psychological factors associated with schizophrenia. 
The findings of these studies have been separately reported in an extensive list 
of publications. 

Still better control of conditions is provided by a long-range research project 
in progress at the National Institute of Mental Health (55). In this study, a 
small and carefully selected group of schizophrenic patients live in an experi 
mental ward with controlled diet and other uniform living conditions, while 
a group of normal control subjects live under approximately comparable condi- 
tions in a similar ward. This procedure represents the most far-reaching at- 
tempt as yet undertaken to minimize the operation of extraneous variables in 
comparative studies of schizophrenics and normals. 

Heredity. The extent to which hereditary factors contribute to the etiology 
of mental disorders is still largely unknown.’ Since the experimentally con- 
trolled, selective-breeding procedures of animal genetics cannot be applied to 
human subjects, knowledge about human heredity must be derived from 
relatively indirect and uncontrolled sources. Thus human geneticists have 
investigated the extent of consanguinity (marriage of close relatives) in the 
families of persons with a given disorder, the incidence of a disorder in 
relatively isolated inbred communities, and the occurrence of a disorder in 
different members of the same family (26, ch. 8; 51, ch. 1). Because of in 
sufficient information about the operation of other factors, however, these 
approaches rarely provide conclusive evidence of hereditary influences. 

„Data on family resemblances in a given trait or condition are particularly 
difficult to interpret because families represent environmental as well as 
biological units. Even physical disorders may run in families without neces 
sarily being hereditary. Beriberi shows high concentration in certain families, 
not because of a genetic basis, but because of a vitamin deficiency in the 
family diet. Members of the same family not only share many features of a 
common environment but also interact with each other. Hence there are many 
opportunities for mutual influence. Quite apart from heredity, an illiterate 
mother provides a less stimulating intellectual environment for her children 
than does a mother with a college degree. A more subtle but potentially 
effective factor is social expectancy. Since relatives are expected to resemble 
each other, any chance similarity between them is likely to be noticed and 


?For an introduction to the heredity-environment problem in psychology, see 2 (chs. 
3, 4, and 9) and 3. y, 
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magnified. The expected similarities may in turn influence the individual's 
self concept and his subsequent psychological development. 

It is only when a defect is transmitted according to a simple and easily 
identified hereditary mechanism that data on familial incidence can demon- 
strate its genetic origin. For example, if a characteristic depends upon a single 
recessive gene, its observed familial incidence can be checked against the 
expected distribution derived from Mendelian ratios (see 2, pp. 270-274). 
Such simple hereditary mechanisms have been identified for only a few human 
conditions. Among them are certain rare and extreme forms of mental defi- 
ciency, such as phenylketonuria and the amaurotic idiocies. These types of 
mental deficiency have also been traced to metabolic disorders, which in turn 
stem from biochemically defective genes. Knowledge about the many interven- 
ing physiological steps, from defective gene to behavior disorder, helps to 
clarify the hereditary basis of the disorder. For most psychological disorders, 
however, this type of information is unavailable. 

In the effort to rule out some of the uncontrolled factors of familial studies, 
several investigators have worked with identical Cmonozygous) and fraternal 
(dizygous) twins. Since they develop from a single fertilized ovum, identical 
twins have identical sets of genes. Fraternal twins, on the other hand, are no 
more alike in heredity than ordinary siblings, although they are born at the 
same time. The usual procedure in twin studies is to locate individual cases of 
twins who manifest a particular disorder. The incidence of the disorder among 
the co-twins and other relatives of these "index cases" is then investigated. 
When both members of a pair of related persons (twins, siblings, parent- 
child, etc.) exhibit the same disorder, they are described as "concordant"; 
when only one member shows the disorder, the pair is described as "dis- 
cordant." 

The twin-study method was employed in an extensive research project on 
schizophrenia conducted by Kallmann and his associates (53). Beginning with 
953 twin index cases (268 identical and 685 fraternal), all of whom were 
institutionalized in New York State and diagnosed as schizophrenic, the in- 
vestigators proceeded to determine which, if any, of the relatives had been 
similarly diagnosed. The principal results are summarized in Table 21. It will 
be seen that 86.2 per cent of the identical twin pairs, as contrasted to 14.5 


TABLE 21 Concordance Rates for Schizophrenia as Determined 


for 953 Twin Index Cases in New York State 
(Data from Kallmann, 53, p. 146) 
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Concordance 

Type of Kinship Rate 
Monozygous twins 86.2 
Dizygous twins 14.5 
Full siblings 14.2 
Half siblings 7.1 
Step siblings 1.8 
Parents 93 


Expectancy in general population: 0.7-0.9 
SY 
* Corrected. for age to allow for the fact that the expectancy rate for schizophrenia 
varies with age. 
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per cent of the fraternal twin pairs, were concordant. It is particularly note- 
worthy that the concordance rate for fraternal twins is practically the same as 
that for siblings. The findings of this study have been roughly corroborated by 
less extensive surveys conducted in other countries (see 51, ch. 2; 53). 

Although such data strongly suggest the operation of hereditary factors in 
schizophrenia, the conclusiveness of the evidence has been questioned on 
several grounds (51, ch. 2). A number of studies have shown, for example, 
that identical twins receive more similar parental treatment and have closer 
contact with each other than do fraternal twins (see 2, pp. 287-288). Thus 
environmental differences are not ruled out when the two types of twins are 
compared. In addition, certain procedural biases may have operated in thc 
samples investigated (51, ch. 2; 92). Because of their striking physical re 
semblance, for example, identical twins are more likely to be noticed than 
fraternal twins when both members of a twin pair are hospitalized in the 
same institution. A larger proportion of concordant identicals than concordant 
fraternals might thus come to the attention of investigators. Since the diagnosis 
of schizophrenia is so loosely applied, moreover, the fact that a patient's 
identical co-twin is known to be schizophrenic may increase the probability 
that he is also so classified. 

Organic Factors. Any organic defects identified as causal factors in 
mental disorders may themselves have either a hereditary or an environmental 
origin. On the one hand, defective genes may produce metabolic disorders 
that ultimately lead to mental deficiency or other behavioral disturbances. On 
the other hand, psychological disorders may result from a number of condi- 
tions in the prenatal environment, such as toxins, bacterial infections, nutri- 
tional inadequacies, and X-ray irradiation (see 7, 33). Brain injuries occurring 
during the birth process, as well as neonatal anoxia, may also cause intellectual 
defects and other psychological abnormalities. Severe diseases like meningitis 
or encephalitis in early childhood may have similar results. 

The effects of many of these conditions have been investigated experi- 
mentally in animals as well as through follow-up studies of human infants. 
In a well-controlled animal experiment, for example, oxygen deprivation dur- 
ing gestation or immediately after birth significantly decreased the learning 
performance of rats and cats at maturity (69). In another study (38), 355 new- 
born infants were classified into those who had had normal birth, those who 
had experienced anoxia, and those who had undergone other birth complica- 
tions (prematurity, injuries, etc.). Three years later, the children were given 
neurological and psychological examinations, To control observer bias, the 
examiners had no knowledge of the subjects’ original classification, The anoxia 
cases performed significantly more poorly on the Stanford-Binet and other 
psychological tests and showed signs of neurological impairment. Similar but 
more varied results were obtained with other birth complications, This study 
supported the view that “minimal brain damage,” too slight to cause con- 
spicuous disorders like cerebral palsy, may nevertheless produce behavioral 
deficiencies, 

Following an extensive survey of many different complications of gestation 
and birth, Pasamanick and his associates (56, 60) proposed a “continuum of 
reproductive casualty.” By this they meant that varying degrees of prenatal or 
natal trauma will lead to a continuum of effects ranging from mild intellectual 


413 


Clinical Research 


or emotional disorders to severe mental deficiency, cerebral palsy, and death. 
The data for this survey were obtained retrospectively, by examining the 
birth certificates and obstetrical records of children whose subsequent condi- 
tion was known. In the same survey (86), an analysis of results by socio- 
economic level showed a marked excess of prenatal and natal abnormalities in 
lower socioeconomic levels, probably resulting from dietary deficiencies and 
inferior maternal care during pregnancy. These findings suggest a possible 
basis for some of the socioeconomic differences that are consistently found 
both in intelligence test performance and in the incidence of emotional 
disorders. 

Organic factors have also been extensively investigated in connection with 
the etiology of schizophrenia (8; 35; 51, parts II and III; 55; 89; 94). Signifi- 
cant differences between schizophrenics and normals have been found in 
electroencephalograms, incidence of structural defects of cortical neurones, 
metabolic processes, blood chemistry, and endocrine functions. Several in- 
vestigators have measured the toxicity of the serum of schizophrenics, using 
various animal forms as test objects. Research with psychotomimetic drugs, 
which simulate psychotic symptoms in normal persons, suggests that schizo- 
phrenia may be characterized by a disordered metabolic response to stress, 
which in turn derives from a biochemical defect (94). This hypothesis could 
also account for the effect of certain tranquilizers in terms of their specific 
biochemical action. 

All this research activity is opening up promising new approaches to the 
etiology of schizophrenia and other psychological disorders. At this stage, how- 
ever, conclusions would be premature. In interpreting the findings of any 
study in this area, we must bear in mind, first, that schizophrenia is a broad 
and loosely applied diagnostic category, which actually covers a variety of be- 
havioral difficulties. Different forms of schizophrenia may have different 
etiologies. Even the same symptoms may result from different factors in differ- 
ent persons. It is likely, too, that organic and experiential factors interact in 
the development of schizophrenic disorders. Thus either hereditary or environ- 
mentally produced organic deficiencies may make the individual more vulner- 
able to psychological stress. In the absence of severely stressful experiences, 
however, behavioral disturbances may fail to develop even in one who is 
organically predisposed to them. 

When organic differences are found between schizophrenics and normals, 
we must also consider the possibility that the differences may be a result of 
emotional stress, degree of activity, nutritional state, and other variables asso- 
ciated with either the psychotic condition itself or with institutionalization 
(see 48; 55; 89, ch. 7). It was found, for example, that the serum of schizo- 
phrenics oxidized adrenaline more rapidly than did the serum of normal 
controls. Further investigation, however, demonstrated that this difference 
resulted from a vitamin C deficiency in the diet of the institutionalized 
schizophrenics (5). When this variable was controlled, the difference between 
schizophrenics and normals disappeared. Several other alleged biochemical 
differences between schizophrenics and normals likewise disappeared when 
checked in the well-controlled study at the National Institute of Mental 
Health (55), cited earlier in this chapter (p. 410). 

Experiential Factors. In line with the recognition that learning plays an 


414 


CLINICAL PSYCHOLOGY 


important part in the development of adjustment mechanisms, problem-solving 
techniques, and other response variables, there has been an increasing amount 
of research on the relation between experiential factors and psychological 
disorders. Particular interest has centered on familial experiences during the 
formative years of childhood, although adult experiences in different cultures 
and subcultures have also received attention. Only a few examples will be 
cited to illustrate some of the approaches followed in this broad and active 
area of current research. 

One group of studies has investigated the mothers Cand occasionally the 
fathers) of schizophrenics—their personality characteristics, their attitudes to- 
ward children, and their child-rearing practices. Although the studies differ in 
many ways and have yielded some conflicting results, certain suggestive trends 
have emerged (see 47; 51, chs. 10-14; 54). The term schizophrogenic mother 
has been introduced to refer to the type of mother whose behavior tends to 
induce schizophrenia in her offspring. Such mothers have been described as 
emotionally immature, dominant, rigid, restrictive, and manifesting extremes 
of either rejection or overprotectiveness toward the child. They tend to be 
impervious to the child's feelings and to discourage the development of auton- 
omy. Another characteristic attributed to them is “intrusiveness,” or an unwill 
ingness to give the child any privacy, even in his thoughts. According to one 
hypothesis, the schizophrogenic mother, because of her own emotional con- 
flicts, habitually gives the child conflicting and mutually incompatible com- 
munications of feelings and ideas (4o; 51, ch. 13). 

The foregoing data have been gathered by a variety of techniques, including 
interview, questionnaire, and psychological test studies of the mothers, as well 
as retrospective reports by patients. The latter procedure, of course, is subject 
to perceptual distortions by the emotionally disturbed informant. Some in- 
vestigators have utilized a short situational test to observe the interpersonal 
relations of parents and child (34). In a much more intensive investigation of 
the total family pattern, schizophrenic patients and their parents lived to- 
gether on a psychiatric ward in a research center for as long as 2% years C51, 
ch. 12). In a few families, normal siblings also lived on the ward for as much 
as a year. All family members participated in joint psychotherapy sessions as 
a family unit. Data were gathered through these therapeutic sessions as well 
as through daily observations by nurses and ward attendants. In all these 
studies, it is difficult to disentangle cause-and-effect relations. We do not know 
to what extent the abnormal behavior of the patient may have led to the dis- 
turbed family relations. Only a longitudinal study could answer this question 
conclusively. 

Cultural differences in family patterns were highlighted in a study of the 
specific symptoms of schizophrenic patients differing in national origin (105). 
The subjects were two groups of male schizophrenics in a Veterans Administra- 
tion hospital. While closely equated in age, education, socioeconomic level, 
religion, and number of American generations, one group was of Irish and 
the other of Italian extraction. Both personality test performance and ratings of 
ward behavior revealed significant group differences in the nature of the 
schizophrenic symptoms. The lrish-Americans showed a greater tendency 
toward imaginative and fantasy behavior and more inhibition of motor ex- 
pression; the Italian-Americans were more overtly aggressive and impulsive. 
The investigators related these findings to subcultural differences in child- 
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rearing practices and family constellation, as well as to other cultural factors. 
A later study, dealing with female schizophrenics from the same two national 
groups, revealed very similar differences in symptoms (28). 

Another approach is concerned with epidemiology, or the distribution of 
mental disorders with reference to various population characteristics (45, 58, 
84, 85, 99, 108). Of particular interest is the greater incidence of mental 
disorder among lower than among higher socioeconomic levels. This differ- 
ence has been repeatedly demonstrated and persists when selective factors 
have been ruled out by a variety of procedures. Among the many possible 
conditions contributing to these differences are child-rearing practices, educa- 
tional level, amount of psychological stress encountered in occupational and 
other adult activities, and class differences in attitudes toward mental health 
and therapy. 

A comparison of the socioeconomic level of the patients themselves with 
that of their parents sheds some light upon the probable influence of specific 
etiological factors. Surveys conducted in America reveal that the fathers of 
schizophrenics tend to be in the same occupational level as the patients (45). 
Hence the conditions associated with low socioeconomic level have been 
operative since the patient's childhood. In British surveys, on the other hand, 
the occupational distribution of the fathers is fairly typical of that found in 
the general population, while the schizophrenic sons tend to cluster in the 
unskilled labor level (75). In neither country, however, can the low occupa- 
tional status of the patients be attributed to a downward drift on the part of 
the patients themselves, since the patients tended to remain in the same oc- 
cupational level throughout their own working life. These analyses illustrate 
the complexity of conditions to be considered in interpreting epidemiological 
findings. Nevertheless, the epidemiological approach holds much promise for 
both the understanding of mental disorders and the identification of preventive 
measures. 

Finally, mention should also be made of the extensive anthropological 
literature pertaining to mental abnormalities in different cultures (see 2, pp- 
617-619; 23; 30; 41; 51, ch. 10; 61; 117). Not only the total incidence of 
mental disorders, but also the relative frequency of such syndromes as schizo- 
phrenia and manic-depressive psychoses or of anxiety and conversion hysteria, 
vary widely among cultures. Moreover, some syndromes appear to be specific 
to certain cultures and absent in others. The social history of our own culture 
shows temporal changes and "fashions in abnormality," as illustrated by the 
dancing manias of the Middle Ages and by the conspicuous decrease in the 
frequency of hysterical anesthesias and motor disorders between the two world 
wars. The whole problem is further complicated by the fact that behavior that 
is deviant or disabling in one culture may be accepted and even favored in 


another (2, pp. 614-617). 


INTEGRATION OF CLINICAL AND 
EXPERIMENTAL APPROACHES 


With the growing interest in basic research on mental disorders, more and 
more efforts are being made to integrate clinical and experimental approaches 
in this area. As a result, clinical observations, laboratory studies of human 
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subjects, and animal experiments tend to be combined in the formulation and 
testing of hypotheses. On the one hand, controlled experiments provide ob- 
jective tests of clinical “hunches” and working hypotheses; on the other, psy- 
chotherapy contributes to a fuller understanding of normal personality. Be- 
cause the non-threatening psychotherapeutic situation induces the client to 
communicate more freely and fully than he does in most other interpersonal 
contexts, it can provide a unique data-gathering opportunity. 

Although the research possibilities of the combined clinical and experimental 
approach are still largely unexplored, a number of psychologists representing 
different theoretical orientations have made notable contributions in this direc- 
tion. Following an early observation by Pavlov in the course of his conditioning 
experiments, several investigators have established “experimental neuroses” in 
dogs, cats, rats, sheep, and other animals by exposing them to conflict situa- 
tions (see 26, ch. 19; 102, pp. 112-118). The usual procedure is to condition 
approach and avoidance responses to the same or very similar stimuli. For 
example, cats conditioned to open a food box and eat when a buzzer or a light 
was turned on were later stimulated by an air blast as they approached the 
food box (66). The resulting indications of anxiety and disorganization of 
behavior were more intense and lasting than those following simple stimula- 
tion by an air blast. Through several subsequent studies, it was demonstrated 
that it was the conflictual nature of the situation that produced the behavioral 
symptoms observed in these experiments. 

To facilitate the formulation of testable hypotheses, a number of psycholo 
gists have applied concepts derived from general psychology to the description 
of neurotic and psychotic behavior. Some have used concepts from information 
theory (70), others from sensory deprivation (51, pp. 269-272; 93), and still 
others from developmental psychology (36). Because of the key position it 
occupies in all psychology, learning theory has been employed ‘most widely 
in proposed explanations of behavioral deviations. These formulations have 
been prepared within the framework of different learning theories (see, c.g. 
19, 64, 76, 77, 118). i 

One of the most thoroughgoing formulations, which has stimulated ex- 
tensive research, is that given by Dollard and Miller in their book on Personal- 
ity and. Psychotherapy (19) and extended in later publications by Miller and 
his co-workers (see 72). These investigators first reformulated common psycho 
analytic concepts such as repression, displacement, and transference in terms 
of the conditioned-reaction learning theory developed by Hull (49). Together 
with a group of their associates and students at Yale University, they then 
designed and conducted controlled animal experiments to test the various 
hypotheses generated by these theoretical formulations. Finally, they checked 
clinical observations of the behavior of patients in psychotherapy against their 
experimental findings. Working within the same theoretical framework, 
Mednick (67) has more recently proposed an interpretation of both acute and 
chronic schizophrenia in terms of anxiety level and stimulus generalization.” 

Of the many investigations conducted by Miller and his associates, one will 
be described to illustrate the general approach. This study deals with the 
mechanism of displacement, which has been used widely in clinical psychol 


3 See also the critique of this formulation and the rejoinder in Psychol. Bull.. 1059. 
56, 313-316 i 
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ogy to explain symptom formation. Briefly, displacement refers to the substitu- 
tion of another stimulus or response for the one that arouses excessive anxiety. 
Like other symptom mechanisms, displacement has been interpreted as a 
defense against intolerable anxiety. Miller and his associates formulated an 
explanation of displacement in terms of stimulus generalization and conflict 
(71-74, 81, 82). Through a series of ingenious experiments with rats, they 
demonstrated that, as stimulus similarity decreases, avoidance responses decline 
more rapidly than approach responses. In technical conditioning terminology, 
the generalization gradient is steeper for avoidance than for approach. By 
similar procedures, it was shown that avoidance falls off more rapidly than 
approach along a distance gradient. Thus, in a conflict situation, in which 
food and an electric shock are associated with the same goal object, the rat will 
approach part way and then stop. 

Combining these displacement and conflict hypotheses, Murray and Berkun 
(81) conducted an experiment with the apparatus illustrated in Figure 105. 
The rats were first given approach training in the wide white alley, in which 
they could obtain food by raising the lid of the food box. In the second series 
of trials, they received an electric shock upon contact with the food box but 
still found food within it. This represents a typical conflict situation. In the 
third part of the experiment, the walls with windows, shown in Figure 105, 
were inserted and the rats were again placed in the wide white alley. No 
shock was used, and food was available in the white alley only. It was antici- 
pated that the conflict of approach and avoidance would lead to an initial 
approach response for a short way along the white alley. According to the 
displacement hypothesis, the animal should then move to the medium-width 
gray alley. After proceeding some distance along this alley, the increasing 
strength of the avoidance response should again halt progress, and the animal 
should displace to the still less similar narrow black alley. The more different 
the new alley, the closer the rat should approach the food end.* Following 
repeated training with food in the white alley and no shock, the substitute 
responses should gradually be eliminated, and the original direct response in 
the white alley should reappear. 

The empirical findings of the rat experiments confirmed these hypotheses. 
This experiment in turn provided a theoretical model for explaining behavior 
during psychotherapy. It was applied, for example, in predicting the course of 
the client’s verbal expressions of hostility toward various persons in his life as 
psychotherapy progressed. Neither the displacement model nor the conflict 
model alone fitted the observed results. It was only when the two were com- 
bined that the sequence of events in psychotherapy confirmed expectation. 

It is beyond the scope of this book to consider the detailed formulations of 
these hypotheses and the nature of the evidence gathered to test them. The 
interested reader is urged to examine the original publications, especially the 
article by Murray and Berkun (81). For the present purpose this research was 
cited only to illustrate the possibility of combining theoretical formulations, 
clinical observations, and controlled laboratory experiments in an integrated 
research program. 

*In the apparatus used, the most dissimilar alley is also farthest from the original 
alley. For the present purpose, however, it does not matter whether the gradient is 
one of dissimilarity, distance, or a combination of the two. 
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Fig. 105 Apparatus used in studying combined effects of stimulus 
generalization and avoidance-approach conflict in rats. (From Murray 
and Berkun, 81, p. 50.) 
SUMMARY 


The gap between clinical practice and research has been narrowing since 
World War II. A major area of applied research in clinical psychology is con- 
cerned with the validity of diagnostic procedures, particularly projective tech- 
niques and other special clinical instruments. In the validation of such in- 
struments, special attention must be given to questions of blind analysis, stereo- 
type accuracy, multitrait-multimethod validation, cross validation, translation of 
test protocols into testable behavioral predictions, adequacy of criterion meas- 
ures, interaction effects, and unevenness of coverage of the same test for 
different persons. 

Investigations of the psychotherapeutic process deal with changes in client 
behavior in the course of therapy, the specific w ays in which different therapists 
function, and the effect of therapist characteristics upon the progress of psycho- 
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therapy. Process research has been greatly furthered by the recording of 
psychotherapeutic sessions and by content analysis of the verbal responses of 
client and therapist. O sorts have been used by the clientcentered school to 
measure changes in self concepts in the course of psychotherapy. Appreciable 
interactions have been found among client, therapist, and process variables. 
The effects of subtle verbal reinforcement by the therapist upon client responses 
have been repeatedly demonstrated. Besides verbal responses, physiological 
indices of anxiety have also been investigated in the course of psychotherapy. 

It is particularly difficult to obtain conclusive measures of the outcomes of 
therapy. Among the criteria of outcome employed for this purpose are client 
self-report, therapist's evaluation, objective measures of behavioral changes 
during therapy sessions, ward behavior, and follow-up data in real-life situa- 
tions. Representativeness and comparability of experimental and control groups 
are essential but difficult to achieve. There is also need for control procedures 
to eliminate the influence of suggestion. 

Research on the causes of mental disorders is an interdisciplinary venture 
spanning many fields, from biochemistry to anthropology. The sources of in- 
tellectual and emotional disorders have been sought in heredity, organic con- 
ditions, and experiential factors. Basic research that integrates clinical and 
experimental approaches is beginning to make vital contributions both to 
clinical practice and to the science of psychology. The reformulation of clinical 
concepts in terms of learning theory has been especially productive of research 
that cuts across clinic and laboratory. 
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Counseling Psychology 


The Work of the 
Counseling Psychologist 


"Counseling" is a broad term that covers many different functions. A pro- 
fessional counselor, moreover, may have been trained in psychology, social 
work, education, the ministry, or a number of other fields. It is only since the 
1950s that the title of “counseling psychologist” was adopted to designate a 
fully trained psychologist, typically at the Ph.D. level, who specializes in 
counseling functions. The term “counselor” is commonly employed to refer to 
persons operating at a lower professional level and with less training. 

Historically, counseling psychology originated in the vocational-guidance 
movement (51). The early vocational counselors concentrated on the dis- 
semination of information about jobs. Following the rapid growth of psycho- 
logical testing after World War I, tests were widely employed to investigate 
the individual's qualifications for different kinds of work. Still later came the 
recognition that vocational choices and vocational adjustment are intimately 
related to personality development and to total life adjustment. As a result, 
the functions of the vocational counselor were broadened to include a study 
of the whole individual. Concurrently, the rise of the mental health move- 
ment, the introduction of employee counselors in industry, and other develop- 
ments highlighted the need for counselors to deal with a number of personal 
problems other than those associated with vocational choice. At this stage, 
counseling merged with clinical psychology. 

Some psychologists today use the terms "clinical psychology" and "counsel- 
ing psychology" interchangeably and recognize no difference between the two 
fields (see, e.g., 38, 66, 67). To be sure, there is much overlapping in the 
activities of most clinical and counseling psychologists. Among individual 
practitioners, moreover, some counseling psychologists can readily be found 
who in their functions resemble a particular clinical psychologist more closely 
than two clinical psychologists resemble each other. Nevertheless, the two 
fields as a whole can be differentiated, as will be shown in a later section of 
this chapter. This differentiation has received formal recognition by the psycho- 
logical profession (1). Thus the American Board of Examiners in Professional 
Psychology issues separate diplomas in clinical and in counseling psychology. 
Similarly, the Education and Training Board of the American Psychological 
Association accredits university training programs independently in these two 
specialties. The American Psychological Association has also had separate 
divisions in the clinical and counseling fields since 1945, when a divisional 
structure was first introduced into the association. On the other hand, the over- 
lap between clinical and counseling psychology is suggested by the close 
similarity in actual course requirements in university training programs in 
these two fields (64). 
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VARIETIES OF COUNSELING 
Counseling psychologists work in many contexts. Schools and colleges 
represent one of the principal settings for their activities. An increasing 
number are being employed in industrial and business organizations, govern 
ment agencies, and the armed services. Others work in hospitals, rehabilita- 
tion centers, vocational-guidance bureaus, marital clinics, and other community 
agencies. To a large extent, the setting in which a counseling psychologist 
operates determines the type of problems with which he must deal. 
Vocational counseling still represents a major part of the work of most 
counseling psychologists, particularly those employed in student counseling 
bureaus of high schools and colleges (15). Because of its central position in 
the entire counseling field, vocational counseling will be discussed more fully 
in later sections of this chapter as well as in the next chapter. A closely related 
area is that of educational counseling. Frequently the selection of a program 
of studies is implied in a vocational decision. In its own right, however, 
educational counseling is concerned with such decisions as whether or not 
to attend college, the choice of a particular college, what curriculum to pursue, 
or what specific courses to elect. Educational counselors may also deal with 
problems of study skills, reading disabilities, and other academic difficulties. 
Educational counseling is often conducted on a part-time basis by classroom 
teachers. To a more limited extent, such parttime counselors may also provide 
vocational counseling within academic settings, from the elementary schoo! 
through the college level. In such situations, a counseling psychologist or 
other guidance specialist may direct the counseling program or serve as a 
consultant in its administration. Counseling psychologists also conduct in- 
service training and special training institutes for educational and vocational 
counselors and participate in the general training programs conducted in 
schools of education. More will be said about these functions in connection 
with the relations of psychology to education, to be considered in Chapter 18. 
Guidance functions are now regarded as an integral part of the daily class- 
room activity of elementary and high school teachers. Accordingly, guidance 
techniques are regularly included in teacher training programs, and several 
books on guidance have been directed specifically to the needs of teachers 
(see e.g. 46, 59). Group counseling techniques suitable for use in a school 
situation have also been developed (44, 47, 91). There is evidence that such 
group procedures may be as effective as individual counseling under certain 
circumstances (83, ch. 13). At the college level, counseling functions are 
generally integrated in the student personnel program, which covers also 
housing, student activities, student government, health services, and other 
extracurricular matters (60). To aid in the vocational counseling of college 
students, a number of books have been written for use by the students them- 
selves Ce.g., 37, 58). 
mployee counseling in industry was ushered in by the Hawthorne Studies, 
ussed in Chapter 6. The growing recognition of the importance of employee 
morale has led to the appointment of employee counselors in many industrial, 
business, and governmental organizations. The activities of such counselors are ; 
usually quite varied, ranging all the way from assistance in locating suitable 
housing to counseling on emotional problems. The problem need not be ] 
directly related to the job. For some maladjusted employees, the source of the | 
difficulty may be at home or in some other problem arising off the job. | 
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Depending upon the employee counselor's professional qualifications and 
the time available per client, he may handle problems himself or make ap- 
propriate referrals. In any event, an employee counselor would not himself 
undertake long-term psychotherapy with seriously disturbed individuals. In 
order to establish an effective counseling relationship, the employee counselor 
should have no administrative duties or authority. Only under these conditions 
would the employee feel completely free to discuss his problems with the 
counselor. In some companies, employee counseling is carried on by independ- 
ent consulting psychologists on a parttime basis. This practice is followed 
especially in the counseling of executive personnel. 

Rehabilitation counseling is a relatively recent but rapidly expanding field 
of activity for the counseling psychologist (38, ch. 12; 40; 55; 88). The 
rehabilitation counselor works with the orthopedically handicapped, the 
tubercular, the cardiac, the blind, the deaf, and other types of disabled 
persons. One of the recently formed divisions of the American Psychological 
Association is concerned with psychological aspects of disability (see Appendix 
A), and in 1958 the association sponsored a special institute on psychology 
and rehabilitation (94). There is also a growing body of psychological re- 
search dealing with the characteristics and problems of the physically disabled 
6, 22, 95). 

Counseling of the physically handicapped must be closely coordinated 
with remedial and training programs. Such counseling may have several ob- 
jectives. In the case of children, the physical disability is likely to have 
delayed psychological development through the curtailment of normal experi- 
ences. Such children may require not only special educational programs but 
also interpersonal experiences for normal personality growth. Handicapped 
persons of any age tend to have special emotional problems, such as feelings 
of insecurity or hostility and various types of defensive reactions. These re- 
actions often stem from a feeling of being different from one's associates. An 
important goal of rehabilitation counseling is the development of a realistic 
and constructive attitude toward one’s handicap. Emotional acceptance of 
prosthetic and other adaptive devices is a closely related problem. ' 

The successful adjustment of the disabled person often requires counseling 
with family, teachers, employers, and other associates who interact. with the 
individual in significant ways. Vocational counseling is also. an important 
aspect of the rehabilitation program. The physical disability limits the voca- 
tional opportunities available to a young person entering the world of work 
and often makes a return to the former job impossible for a previously em- 
ployed worker. Satisfactory work adjustment can be achieved by a combination 
of intensive testing with realistic work samples, special training, experience 
in sheltered workshops, and eventual job placement. Such a program needs 
to be closely integrated with a medical evaluation of the limits set by the 
physical disability (see, e.g., 34). 

Patients who have been institutionalize A dis 
may also require rehabilitation before resuming their normal activities. Today 
the scope of rehabilitation counseling has been broadened to cover mental as 
well as physical disabilities (94). A noteworthy development in this con- 
nection is the gradual inclusion of psychiatric patients 1n rehabilitation 
centers originally established to care for the physically disabled. The Institute 
for the Crippled and Disabled in New York City has successfully initiated 
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such an extension of its services. Similar institutes throughout the country 
have begun to accept a few psychiatric patients, although no formal recognition 
of this function is included in their objectives. Since the physically disabled 
often have psychiatric problems, institutes designed to serve such patients 
may already have the facilities and experience for handling psychiatric dis- 
abilities. 

Old-age counseling is concerned with the vocational and personal guidance 
of older persons, with retirement problems, and with the special intellectual 
and emotional disorders associated with old age. Practical concern with the 
problems of advanced age has been stimulated by the relatively large number 
of older persons in our society. As medical progress and better living con 
ditions lengthen the life span, the proportion of older persons in the population 
continues to increase. Gerontology, or the study of old age, and geriatrics, 
dealing with diseases of old age, are now thriving fields of research and pro- 
fessional specialization. 

The active interest of psychologists in these fields is evidenced by the 
establishment of a Division on Maturity and Old Age in the American 
Psychological Association and by an increasing amount of research on the 
changes occurring beyond maturity in intelligence, special aptitudes, emotional 
reactions, interests, and attitudes (see 2, PP. 238258; 10; 11; 71). The use 
of longitudinal approaches, large representative samples, and improved experi- 
mental designs is beginning to clarify the nature and causes of these changes. 
Special attention has also been given to the industrial skills of older workers 
and to ways of maximizing their effective utilization (e.g, 92). 

The adjustment problems of older persons provide a fertile field for the 
application of psychotherapy, but one that has not yet been adequately ex- 
plored (65). Among very old persons, intellectual and emotional deteriora- 
tion is likely to have some organic basis. But, even in these cases, experiential 
factors play a significant part, and. psychotherapeutic techniques are likely 
to bring about some improvement. As in the case of rehabilitation counseling 
with the handicapped, the psychological counseling of the aged often needs 
to be coordinated with medical care and with social work. 


marriage counseling, this is so far one of the least-developed fields of counsel- 
ing psychology (49, 72). 
It is apparent that counseling with regard to emotional problems, attitudes, 
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and interpersonal relations runs through all forms of counseling. In addition, 
some individuals seek the help of a counselor explicitly because of an emo- 
tional problem. A recent college survey indicated the prevalence of emotional 
problems among college students and the need for college counselors equipped 
to handle them (12). But even when the referral problem pertains to a 
vocational decision, an educational question, or some other specific and rela- 
tively "impersonal" matter, the root of the difficulty often lies in a personality 
problem. Moreover, personality difficulties frequently complicate and aggravate 
other problems. 

Counseling with regard to emotional problems is usually designated as 
personal counseling, although it is also known as adjustment counseling, 
psychological counseling, or short-term psychotherapy. It is in this area that 
the psychologist can make his most distinctive contribution. Furthermore, 
the counseling psychologist recognizes the close interrelation of all adjustment 
problems and therefore functions at a broad level regardless of the type of 
counseling involyed. His orientation tends always to be toward counseling the 
whole individual. It is in connection with personal counseling, too, that the 
work of the counseling psychologist comes closest to that of the clinical 
psychologist. Hence at this point it is appropriate to examine more fully 
the relation between these two fields. 


RELATION BETWEEN CLINICAL AND 
COUNSELING PSYCHOLOGY 


There are unquestionably many similarities between the functions of a 
clinical and a counseling psychologist. For both, the interpersonal relation 
between psychologist and client is of prime importance. As in clinical practice, 
the counseling relationship is nonthreatening, confidential, permissive, and 
accepting (83, pp. 14-17). Similarly, counseling depends upon verbal com- 
munication. Most counseling psychologists employ some variant of the client- 
centered, or nondirective, approach discussed in Chapter 14, although many 
combine this approach with considerable information giving of a fairly direc- 
tive type. Classical psychoanalysis is least likely to be used in a counseling 
situation. 

What of the differences? Perhaps the primary difference is that the aim 
of clinical psychology is to change the individual’s basic personality structure 
and personal constructs, while the aim of counseling psychology is to enable 
the individual to utilize his present resources most effectively in solving prob- 
lems. Tyler (82; 83, ch. 11) has introduced the concept of "minimum-change 
therapy" to characterize the counseling process. Its object is to bring about 
readjustment with a minimum of personality change rather than to delve into 
the sources of anxieties and conflicts. “ , 

While psychotherapy tries to alter a client's anxiety level, defensiveness, 
and other generalized response habits, counseling concentrates on the ae 
of specific problems, such as poor study habits or lack of social poise, en 
may be interfering with effective functioning (8). It is also concerned with 
the making of decisions and plans regarding educational, vocational, and 
other courses of action. The counseling psychologist, however, does not as a 
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rule give advice or tell the client what he should do. It is now generally felt 
that to do so would reinforce habits of dependency rather than facilitate 
personal growth. 

Because it does not undertake to restructure a client's personality or to 
provide insight into all his emotional conflicts, counseling characteristically 
requires less time than does psychotherapy. It will be recalled that duration 
of treatment also differentiates nondirective therapy from psychoanalysis 
CChapter 14). In many ways, the differences between counseling and clinical 
psychology are similar to those between the nondirective and the psycho 
analytic approaches. Among client-centered, or nondirective, practitioners, thc 
distinction. between counseling and psychotherapy is most likely to bc 
blurred. 

Another difference between clinical and counseling psychology is that the 
former focuses on weaknesses to be overcome, while the latter focuses on 
positive strengths to be developed (82; 83, ch. 11). Typically the counselor 
is not concerned with ferreting out personality deficiencies that are adequately 
controlled. Rather he concentrates on the utilization of any available assets 
that he can identify in the particular client. Slight shifts in the direction in 
which an individual is moving may enable him to capitalize on these assets 
and to bypass obstacles. The counseling psychologist looks for the normalities 
found even in abnormal persons, whereas the clinical psychologist and, to a 
greater extent, the psychiatrist look for the abnormalities found even in 
normal persons. In its emphasis upon positive development and the preven- 
tion of maladjustment, counseling is consistent with the aims of the mental 
health movement (Chapter 14). Perhaps, as we learn more about the causes 
and Prevention of mental disorders, there will be an increasing demand 
for counseling psychologists and a decreasing need for clinical psychologists. 
Unlike psychotherapy, counseling frequently requires the giving of factual 
information (39; 83, ch. 9). Thus, in educational and vocational counseling, 
the individual needs information about job requirements and opportunities, 
suitable schools and courses of study, effective study skills, or test norms in 
terms of which he may evaluate his own performance. Such information 
should be provided when it will help the client to think realistically and con- 
structively about his problems. The counselor may impart the information 
himself during an interview, or he may refer the client to appropriate persons 
or to readily available published sources. In any event, the information should 
be Presented in such a way as to be understood and accepted. Effective com- 
munication, including sufficient feedback from the client, is essential to this 
aspect of counseling. 

The counselor himself also needs to be familiar with up-to-date and de- 
pendable sources of information in his particular area of operation, Since 
vocational counseling plays such an important part in the activities of most 
counselors, a knowledge of the world of work is a common requirement in 
the training of counseling psychologists. In fact, the inclusion or noninclu- 
sion of courses on occupational information often represents the principal 
difference between the basic training of counselors and clinical psycholo 
gists (64). i 

The type of client with whom clinical and counseling psychologists work 
has often been cited as another difference between the two fields. It is cer 
tainly true that in general the counseling Psychologist works with more nearly 
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normal persons who may want help in making a specific decision or who 
may have minor personality difficulties. Hahn (39) sums it up by saying 
that the counseling psychologist is most often concerned with persons whose 
anxiety level is interfering and disruptive rather than disabling or disintegra- 
tive. To be sure, this distinction is only relative. On the one hand, some clinical 
psychologists will accept no severely disturbed cases for psychotherapy; they 
exclude all psychotics, as well as neurotics with symptoms of long standing. 
On the other hand, some counseling psychologists work with hospitalized 
psychotic patients. In the latter cases, however, the counseling psychologist 
is concerned with the rehabilitation and vocational adjustment of the patient 
rather than with treatment of his psychotic condition. Frequently such counsel- 
ing is introduced shortly before discharge, to enable the individual to make 
a more effective job and community adjustment. An example is the extensive 
program in counseling psychology conducted in Veterans Administration 
hospitals. 

The difference between clinical and counseling psychology has also been 
described in terms of the characteristic orientations and attitudes of the psy- 
chologists who specialize in the two areas. It has been suggested, for example, 
that counseling psychologists resemble industrial psychologists more closely 
than they do clinical psychologists (39). In contrast to the clinical psycholo- 
gist, the counseling psychologist is more likely to rely upon objectively con- 
structed tests and empirical norms than upon subjective qualitative approaches. 
He relies less on clinical judgment or projective tests and more on structured 
ability and interest tests. Counseling psychologists often have to work closely 
with members of other professions, as was apparent in the preceding section. 
They must therefore be able to function effectively in various settings and 
to cooperate with persons whose training and background are very unlike 
their own. In several of these contexts, counseling psychologists also tend to 
have administrative and managerial responsibilities. à 

Still another point of view about the relation of clinical and counseling 
psychology has been expressed by Vance and Volsky (89). While maintain- 
ing that clinical and counseling processes can be differentiated, they argue 
that these two kinds of treatment are often required by the same client and 
hence should not be artificially separated in practice. On these grounds they 
advocate combined training in clinical and counseling processes for each 
practitioner. 


OF TESTS IN COUNSELING 


Types of Tests Used in Counseling. Tests may be helpful in any kind 
of counseling but are most often employed to facilitate educational and 
vocational decisions. All types of tests may be used in counseling and prob- 
ably are, but certain types clearly predominate. For a detailed treatment of 
tests that are of special interest to vocational counselors, the reader is referred 
to Super and Crites (78). General surveys of available tests can be found in 
current texts, such as Anastasi (3) and Cronbach (21). The preliminary dis- 
cussion of the use of tests in personnel selection given in Chapter 3 is also 
relevant. ‘ 

In counseling, intelligence tests are commonly employed to determine the 
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individual's general ability level, with special reference to his scholastic apti 
tude. Persons engaged in different occupations differ significantly in mean 
intelligence test scores. The clearest demonstration of these differences wa: 
provided by the analyses of intelligence test scores of large samples of mei 
in the United States Army during both world wars (32, 74). Figure 106 
illustrates the results with a few occupations selected from different portions 
of the total range. There are probably many reasons for these occupational! 
differences, but a major factor is undoubtedly the educational level required 
for different types of work. The professions and other occupations calling 


Fig. 106 Scores on the Army General Classification Test in relation to 
civilian occupation. ( Data from Stewart, 74, pp. 5-13.) 
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for a high level of education typically show a higher average and a narrower 
range of intelligence test scores than do routine clerical and manual occupa- 
tions, 

Since most intelligence tests are primarily measures of scholastic aptitude, 
one of their applications to counseling is in predicting educational achievement 
and in estimating the highest educational level the individual is likely to 
reach. Intelligence tests are especially useful at the elementary school ages, 
when differentiation into separate abilities has not yet advanced very far (see 
2, PP- 357-361). Typically, counselors rely on group tests of intelligence, which 
may of course be administered either individually or in groups, and which 
are available for different levels ranging from the primary grades to the gradu- 
ate school. 

For educational counseling, intelligence tests are often supplemented with 
achievement tests in different areas. Of particular interest are tests of reading 
and arithmetic skills. Special disabilities in these skills—particularly in read- 
ing—may seriously interfere with over-all educational achievement. Special 
aptitude tests are frequently administered in individual cases as demanded 
by the particular circumstances. When considering a special educational pro- 
gram or making vocational plans, the individual may need information regard- 
ing his manual dexterity or other motor skills or about his mechanical, 
clerical, artistic, or musical aptitudes. Tests are available to aid evaluation in 
all these areas. 

Multiple aptitude batteries are of particular relevance to counseling prob- 
lems because they provide a profile of scores in relatively independent abilities. 
Combining the principal abilities covered by intelligence tests with other 
broad aptitude areas, they enable the individual to explore his major strengths 
and weaknesses. While a global intelligence test score may indicate general 
level of expected educational or vocational attainment, a profile of scores on 
a multiple aptitude battery is more helpful in the choice of a field of specializa- 
tion, 

A multiple aptitude battery that has been in use long enough to permit 
the accumulation of extensive validity data is the Differential Aptitude Tests 
(7). Sample items from each of the eight tests comprising this battery can be 
found in Figure 107. A score profile on the Differential Aptitude Tests was 
reproduced in Figure 15 (Chapter 3). Designed for grades 8 through 12 and 
for unselected adults, this battery was prepared principally for use in educa- 
tional and vocational counseling. The General Aptitude Test Battery de- 
veloped by the United States Employment Service (25) is employed widely 
in vocational counseling. Other generally available aptitude batteries have 
not yet been sufficiently validated to be considered ready for use in indi- 
vidual counseling (see 3, ch. 13; 77). 

Interest tests are undoubtedly one of the most popular counseling instru- 
ments. Essentially these tests are self-report inventories in which the individual 
records his likes and dislikes or his relative preferences for many kinds of 
familiar activities, people, or objects. In the Kuder Preference Record (52), 
the individual always compares three items, among which he marks the one 
he likes best and the one he likes least. For example, would he rather visit an 
art gallery, browse in a library, or visit a museum; would he rather collect 
autographs, coins, or butterflies? The responses are scored with reference to 
10 broad interest areas: Outdoor (agricultural, naturalistic), Mechanical, Com 
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VERBAL REASONING. 


Choose the correct pair of words to fill the blanks. The first word of 
the pair goes in the blank space at the beginning of the sentence; the 
second word of the pair goes in the blank at the end of the sentence. 


. is to water as eat is to ...... 


. continue — drive 
foot —— enemy 
drink —— food 
girl —— industry 
drink —— enemy 


POOR : 


The correct answer is C. 


NUMERICAL ABILITY. 


Choose the correct answer for each problem. 
Add 13 A M Subtract 30 

12 B 25 20 

= C 16 — 


å 8 
E none of these E none of these 


The correct answer for the first problem is B; for the second, E. 


ABSTRACT REASONING. 


The four “problem figures” in each row make a series. Find the one 
among the “answer figures” which would be the next in the series. 


EEY CAHE 


The correct answer is B. 


SPACE RELATIONS. 


Which one of the following figures could be made by folding the given 


pattern? The pattern always shows the outside of the figure. Note the 
three grey surfaces, 


Beceec 


The correct answer is D. 


Fig. 107a : Sample items from the Differential Aptitude Tests, (Reproduced 
by permission. C opyright 1963, The Psychological Corporation, ) 


MECHANICAL REASONING. 


Which man in this picture has the heavier load? 


The correct answer is B. 


CLERICAL SPEED AND ACCURACY. 


In each tesi item, one of the five combinations is underlined. Find the 
same combination on the answer sheet and mark it. 


TEST ITEMS SAMPLE OF ANSWER SHEET 


LANGUAGE USAGE: I. SPELLING. 


Indicate whether each word is spelled right or wrong. 


W. man 


x. 


gurl 


LANGUAGE USAGE: II. GRAMMAR. 


Decide which of the lettered parts of the sentence has an error and 
mark the corresponding letter on the answer sheet. If there is no 
error, mark E. 


Ain't we / going to / the office / next week? 
A B c D A 


Fig. 107b Sample items from the Differential Aptitude Tests. (Reproduced 
by permission. Copyright 1963, The Psychological Corporation.) 
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putational, Scientific, Persuasive, Artistic, Literary, Musical, Social Service, 
and Clerical. In addition, there is a verification scale for detecting carelessness 
or failure to follow directions. 

It should be noted that Kuder scores indicate relative interest in these 10 
areas, rather than absolute amount of interest. A flat profile might thus 
indicate that the individual is equally interested or equally uninterested in 
all of them. The test is constructed on the same principle as a Q sort, in 
which comparisons are made within the individual. A sample profile of 
scores on the Kuder is reproduced in Figure 108. This subject has his strong- 
est interests in the literary field, with social service and outdoor interests fol- 
lowing very closely. Other interest areas are low, the mechanical being un- 
usually weak. 

The Vocational Interest Blank developed by Strong (75) yields separate 
scores in each of about fifty occupations in the men’s form and about thirty 
in the women’s form. The construction of these keys was described in Chapter 
3. Essentially, a high score in any given occupational key means that the 
individual’s interests resemble those of persons engaged in that occupation. 
It suggests that he would find that occupation and his colleagues in it con- 
genial and that he would fit in well with their way of life. For guidance 
purposes, the total pattern of occupational scores is customarily taken into 
account. It may be just as important to know which occupational groups the 
individual is most unlike as it is to know which groups he resembles most 
closely, 

Figure 109 shows the standard profile chart on which Vocational Interest 
Blank scores are reported. The shaded areas tepresent the expected range of 
“chance scores," indicating neither agreement nor disagreement with the 
interests of that group. For illustrative purposes, the mean score of a group 
of 47 medical students on each key available at the time of testing has been 
marked with an x. As might be anticipated, this group obtained its highest 
mean score on the physician key. Other high scores were found on the dentist 
and chemist keys. 


With regard to the use of the Vocational Interest Blank in counseling, it 
should be noted that most 


lower occupational levels may 


e 0 that evels job satisfaction is more often 
associated with factors extrinsic to the work itself, while at higher job levels 


it is more often associated with the nature of the work (see Chapter 6). 
d to a much more limited extent in 


counselee to understand his own behavior. 


Despite its original pathological orientation, the Minnesota Multiphasic 
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Fig. 108 Profile on Kuder Preference Record— Vocational. 
reproduced by permission of Science Research Associates. 
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Report on Vocational Interest Test for Men 


(See other side for explanation) 
. Agency or school ... 
1| Artist 32 
| Psychologist (rev) 2i 
Architect. 33 
Phy (rov) 
Psychiatrist k 
| [Osteopath = 
Dentist 
Veterinarian ae Ei 
11 | Physicist as 
Chemist 3 
Mathematician 28 
Engineer 35 
III | Production (Manager 3I 
IV [Farmer 37 
nter i 
Printer 3 
Math. Sci. Teacher 
Policeman E 
Forest Service Hg 
Army Officer = E 
Aviator. - 
V | YMCA. Phys. Dir. 
Personne! Manager 
Public Administrator = 
Vocational Counselor = 
YMCA, Secretar; t 
Soc. Sci. Teacher 2 
School Supt. 21 
Minister. 2! 
VI | Musician 
VII | C.P.A. Partner. e 
VIN | Senior CPA, 26 
Junior Accountant 22 
Oifice Worker 28 
Purchasing Agent 26 
Banker 23 
Mortician Li 
Pharmacist - 
IX | Salos 25 
Real Estate Slamn, Ij 
Lifo Insurance Slsmn. 
X | Advertising Man 32 
La 36 
Author Journalist 36 
XI | President 30 p 
tional Level 
Masculinity Peminini 
Hon Level 
Interest Maturi 


Fig. 109 Report form of Stron; 
mean scores of a group of medi 
of Stanford University Press; data from Strong, 76, p. 421.) 


& Vocational Interest Blank, 
cal students. ( Reproduced by 


showing 
permission 


The Work of the Counseling Psychologist 


Personality Inventory (41) has been used extensively in counseling normal 
persons and a considerable amount of relevant research has accumulated. 
On the basis of data obtained with this test from more than four thousand 
college students examined in a college counseling center, Drake and Oetting 
prepared a codebook for interpreting MMPI profiles in this context (24). A 
few self-report inventories have been specially developed for normal persons, 
and their scores are expressed in terms of needs, motives, and interpersonal 
response modes rather than in terms of pathological deviations. Examples of 
such inventories include the Edwards Personal Preference Schedule (26), the 
California Psychological Inventory (35), and the Minnesota Counseling In- 
ventory (9). 

Integrating Tests into the Counseling Process. Under the influence of 
the nondirective, client-centered approach, counseling psychologists have put 
increasing emphasis upon the counseling process itself as a means of con- 
tributing to the client’s development. In keeping with this viewpoint, several 
psychologists have given considerable attention to the way in which tests are 
introduced into the counseling process—how the tests are selected, when 
and how they are administered, and how the results are communicated to 
the counselees (13, 33, 83). Some feel that what tests are chosen and how 
the individual scores on them are less important questions than how the tests 
are incorporated into the counseling process. 

‘Taking tests is itself regarded as a significant experience for the counselee. 
It is not to be thought of as an independent activity, but as a part of the total 
counseling experience. As such, it is a kind of “reality testing,” providing the 
counselee with an opportunity to check his aspirations, plans, and self concept 
against objective standards. The individual's attitude toward taking tests may 
also be quite revealing and may in turn affect his test performance. One 
student may feel threatened if he believes that the decision to enter the college 
of his choice hinges on his test score. Another may hope that a low test score 
will help to dissuade an overambitious father from pushing him into a yoca- 
tion for which he has neither interest nor aptitude. 

Unlike test scores obtained in other situations, the information obtained 
from counseling tests is primarily for the use of the client himself. One excep- 
tion is the occasional administration of preliminary tests to decide whether 
the client should be retained in counseling or referred elsewhere. In general, 
counseling tests should serve to stimulate client interest in areas or courses 
of action he has not previously considered and should provide him with the 
information he needs to make decisions. Tests are not a substitute for decision 
making by the client but should help to clarify the alternatives among which 
he must choose. Tests should therefore be introduced to answer specific 
questions or solve problems presented by the client. 

It should also be noted that test results are more helpful in making negative 
than in making positive decisions. It is easier to determine on the basis of 
test scores which occupations or courses of study to rule out than which to 
elect. This follows from the fact that a certain minimum aptitude, d 
pattern, or whatever the test measures is a necessary but not a sufficient 
condition for success in a given area. Individuals lacking such a qualification 
are likely to fail. But those who meet the requirement are not assured of 
success, since the latter depends on many additional personal and circum- 
stantial factors. 
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In nondirective counseling, the client frequently participates in the selec- 
tion of tests (13, 14, 33, 66, 83). Some counselors introduce no tests at all 
until and unless the client requests them. Others regularly employ tests but 
ask the client to choose those he would like to take from a given list. This 
technique is said to work better than one might anticipate, especially if cer- 
tain cautions are observed. Most counselees lack the technical competence to 
evaluate tests. Hence the list from which a choice is made should contain 
only tests that are appropriate for the client and that have satisfactory reliabil- 
ity, validity, and norms. A simple explanation of the type of information 
each test provides may be given orally by the counselor or included in the 
list. Another approach is to have the client specify the kind of question he 
would like answered about himself, whereupon the counselor chooses the 
specific test for that purpose. 

A possible difficulty may arise from misconceptions about the purpose 
of tests and incorrect notions about their nature, which many counselees have 
acquired from popular sources. The counselee may say he wants to take a 
test "to find out what kind of work I should do" or “to find out if I'm 
normal." Again, he may want to take tests as a means of evading a discussion 
of his real problem or for some other defensive reason. In all these cases, 
the counselor needs to pursue the matter sufficiently and. possibly to provide 
appropriate information to cut through the superficial responses and enable 
the counselee to make a more intelligent decision about test taking. 

On the positive side, it has been argued that client participation in test 
selection has many advantages over a completely counselor-directed program 
(13, 33). If the client cannot see the relevance of the tests to his immediate 
problem, he may simply drop out of counseling. The same result may follow 
if the client is not emotionally ready to submit to a realistic appraisal of him- 
self. When the client is convinced of the need for testing and understands its 
purpose, on the other hand, he will be better motivated to do his best on 
ability tests and to answer truthfully on self-report inventories. He will also 
z more likely to accept test findings, rather than reacting defensively toward 
them. 

The client's own reaction to information about a list of tests from which 
he must choose may also shed light on attitudes and other relevant facts. For 
example, the counselor might learn that a student is overanxious about school 
examinations or that he has a hobby indicating considerable mechanical apti- 
tude. Finally, participation in test selection gives the client an opportunity to 
practice decision making. 

As in many other interpersonal relations, the most effective degree of 
client participation in test selection depends upon the interaction of client, 
counselor, and situational variables. The first includes such client character- 
istics as age, intelligence, sophistication about tests, and emotional maturity; 
the second includes the counselor’s personality characteristics, theoretical orien- 
tation, and attitudes toward directive and nondirective techniques; the third 
includes the context (school, community agency, hospital, etc.), the nature 
of the problem (educational, vocational, personality, etc.), the time available, 
and similar questions. The importance of these interactions is one of the few 
clearly established findings of research on different methods of selecting tests 


for counseling (33, 83). There is apparently no one best way for all circum- 
stances, ! 
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The way in which test findings are communicated to the client is another 
important consideration in the counseling process. The counselor must take 
into account what the test score will mean to the client in the light of the 
latter's individual attitudes, goals, anxieties, and conflicts. He must be ready 
to detect expressions of rejection, hostility, and rationalization toward the test 
scores on the client's part. When such reactions occur, they must be accepted 
as one more fact regarding the client's behavior. 

Test findings need not be reported in the form of scores, since these may 
be misunderstood by a psychometrically unsophisticated client. Rather they 
should be stated as expectancies or probabilities of concrete events, such as 
graduation from a given college, and should be directly related to the ques- 
tions the client set out to answer. Test results should not be introduced until 
the client is ready to discuss them. Even then, the discussion should remain 
sufficiently flexible so that tangential topics brought up by the client may be 
followed up. The client should also be encouraged to integrate test findings 
with other self-knowledge, such as his achievement in different academic 
courses, his job experiences, his interests in various activities, and the like. 
Such integration tends to dispel any “mystery” that attaches to psychological 
tests. When related to similar observations from everyday life, tests are more 
likely to be correctly perceived as behavior samples obtained under stand- 
ardized conditions. 

The counselor in a school or college has the opportunity to accumulate 
longitudinal data about each counselee over a period of several years. It is 
customary in such situations to keep a cumulative record for each individual, 
including test data, biographical information, and continuing observations of 
his reactions toward different teachers, courses, extracurricular activities, and 
other school experiences. Under these circumstances, the counselor himself 
has usually had repeated contacts with the individual over a long time span. 
When properly employed, the cumulative record should prove extremely help- 
ful. It can provide a wealth of information and a picture of developmental 
trends that no cross-sectional counseling contacts could equal. In practice, 
however, there is a danger that the keeping of cumulative records may become 
just one more mechanical chore in a busy schedule. The use of complicated 
and rigid forms may detract from the spontaneous recording of significant 
observations and the meaningful interpretation of developmental trends. 

Another potential advantage available to counselors in school systems lies 
in their opportunity to do certain kinds of research. Because they have access 
to large numbers of students who go through similar criterion situations, such 
counselors are often in a better position than clinicians to conduct applied 
research with data already in their files. For example, entering students can 
be followed up through four years in the same college. The validity of tests 
and other predictors can thus be investigated, and empirical norms and cutoff 
Scores can be established. Similar studies can be conducted with graduates 
of a single school who have entered a particular college in large numbers 
over a period of years, or possibly with groups who have been employed in a 
single community industry. RATS 

Finally, the counselor should accumulate information regarding the criterion 
situations about which his clients must make predictions. In educational coun- 
seling, for example, he must keep up with current data regarding schools and 
colleges in which his clients may be interested. Routine information regarding 
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formal requirements, curriculum, financial aid, and the like, can be found in 
school and college catalogues and in a number of standard handbooks Ceg. 
20, 56, 57). 

There are many other characteristics of schools and colleges, however, that 
should be considered by a client. For example, it is well established that the 
average level and range of scholastic aptitude test scores vary widely among 
colleges. The differences are so large that there is little overlap between the 
top and bottom colleges in these scores. Colleges also differ in their psychologi- 
cal climate and in the kinds of environmental pressures they put upon the 
student. Research has revealed differences among colleges in such characteris- 
tics as relative strength of theoretical and practical values, emphasis placed 
upon social conformity and acceptance by others, extent of extracurricular 
participation, degree of student concern with social action and international 
problems, and pressure to achieve as contrasted to an easygoing atmosphere 
(62, 63, 73). Quite apart from his intellectual qualifications, a student might 

a misfit in one of these college environments and an outstanding leader in 
another. 

In vocational counseling, knowledge about jobs is of such basic importance 
that it has received special attention from the earliest beginnings of the 
vocational-guidance movement. This type of information will be considered 
in the next section. 


OCCUPATIONAL INFORMATION 


Because vocational decisions play a prominent part in the counseling process 
in schools, colleges, community agencies, Veterans Administration hospitals, 
and other contexts, the counseling psychologist must have ready access to 
dependable sources of occupational information. The scope and complexity of 
the modern world of work necessitate special efforts to organize and system 
atize such information. | 
Vocational information includes knowledge about available fields of work 
and types of jobs, nature of the work, worker qualifications, training require- 
ments, pay rates, working schedules and hours, vacation provisions, retirement 
plans and other fringe benefits, characteristics of the working environment, 
and many other specific facts of interest to the potential worker. It involves 
knowledge about national trends as well as local conditions. Obviously, no 
one person could acquire all this information. Even if he could do so, more 
over, his store of knowledge would begin to go out of date at once. What the 
vocational counselor tries to do instead is to become familiar with sources of 
occupational information and with methods for evaluating it, keeping it up 
to date, and making it available to clients, Available sources of occupational 
information have been surveyed in a number of books Ceg, 5, 36, 43, 70) 


Such books are frequently used as texts in occupational information courses for 
counseling psychologists. 


Counseling centers in schools, colleges, community agencies, and other 
settings generally maintain a library and file of vocational information materials 
for use by counselors and clients. Such collections include reference books 


and journals, as well as monographs, pamphlets, and leaflets dealing with 
specific fields, industries, or companies. Some texts on vocational informa 


445 


The Work of the Counseling Psychologist 


tion devote a whole chapter just to the question of how to classify and file 
these vocational materials so that they can be most readily located when 
needed (e.g. 43, ch. 6). To keep their information up to date, counseling 
centers also subscribe to publication services which provide fresh material 
periodically. 

Because of the tremendous quantity and uneven quality of occupational 
literature, the counselor needs to be selective. Some years ago, the National 
Vocational Guidance Association (61) prepared a set of standards for evaluat- 
ing an occupational monograph or pamphlet. The principal requirements are 
recency, objectivity, and readability. Owing to the rapid obsolescence of voca- 
tional information, the date when the information was obtained is a major 
consideration, Objectivity can often be evaluated by examining the source of 
the publication. Government agencies, such as the Department of Labor, as 
well as private publishing firms that prepare occupational materials, are usually 
motivated only to provide information. A specific industry or company, on 
the other hand, is likely to present a more biased picture because of its inter- 
est in recruitment. Popular sources are frequently concerned with the enter- 
‘ainment value of their material and may therefore give dramatized or glamor- 
ized versions. This does not imply, of course, that vocational material prepared 
by « popular magazine or by a company or industry is necessarily biased. 

in publications to be used by clients, it is desirable that the information 
be interestingly presented and readily understood by the type of person for 
whom it is intended. By and large, occupational information materials tend 
to be too difficult for their readers. In one investigation, the Flesch formulas 
described in Chapter 12 were used to measure the reading ease and human 
interest of 78 occupational information booklets from a wide variety of govern- 
ment and private sources (16). Nearly two-thirds were rated as "very difficult," 
or at the level of a typical scientific magazine; another 32 per cent were rated 
as "difficult." With regard to human interest, about two-thirds fell into the 
"dull" category and about a third into the "mildly interesting." 

Among the general sources of occupational information available to the 
counselor, an outstanding publication is the Occupational Outlook Handbook, 
prepared by the United States Bureau of Labor Statistics (85). This book 
summarizes trends in the number of persons employed in different fields. For 
example, in recent decades the proportion of persons in the professions has 
been steadily increasing, as have the proportions in clerical and sales jobs. 
At the same time, the proportion in unskilled labor jobs has been decreasing, 
as machine production and automation have taken over more and more of 
their functions. The Handbook also contains detailed information regarding 
several hundred major occupations, chosen with special reference to the inter- 
ests of students and other young people seeking guidance. The data given for 
each occupation include nature of the work, number and nature of workers 
employed, where employed, training and qualifications, lines of advancement, 
employment trends and outlook, earnings and working conditions, and where 
to go for further information. The presentation is sufficiently simple for a 
junior high school student. To keep the material up to date, the Handbook 
is revised biennially. In addition, supplementary information is provided in 
a periodical, the Occupational Outlook Quarterly. aah 

One of the major contributions to the organization and systematization of 
occupational information was the preparation of the Dictionary of Occupa 
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tional Titles (DOT) by the United States Employment Service (87). The 
DOT provided for the first time a comprehensive classification of jobs, with 
definitions of each job title and a uniform numerical coding system. Volume 
I defines each of about twenty-two thousand job titles in terms o£ the duties 
performed. To obtain this information, job analysts observed samples of 
workers in many different jobs in a nationwide survey. Supplementary in- 
formation was gathered from such sources as libraries, employers, trade and 
labor organizations, and public employment offices. Some eighteen thousand 
alternate titles for the jobs are also listed and cross-indexed. All titles in volume 
I are arranged alphabetically. In volume II, on the other hand, the same titles 
are arranged according to an occupational classification, with similar jobs 
grouped together. 

In both volumes, all job titles are accompanied by their code numbers. 
Hence a counselor or client can look under the appropriate codes in volume 
II for jobs in which he might be interested and can then turn to volume I for 
the corresponding job descriptions. The code numbers include five or six digits 
Of these the first digit denotes the major field, or level, such as professional 
and managerial, clerical and sales, skilled labor, and unskilled labor. Successive 
digits refer to increasingly finer subdivisions, For example, the digit 1 in the 
first place designates the broad category of clerical and sales occupations; 1-01 
represents “bookkeepers and cashiers, except bank cashiers”; and 1-01-05 cor 
responds to “budget clerk.” Similarly, 4-78-o2: is the code number for a 
skilled turret lathe operator. Another volume of the DOT is concerned with 
the classification of entry occupations. These are jobs that require no experi 
ence and through which an individual may enter a field of work (86). 

Although the publication of the DOT did much to introduce order and 
uniformity in occupational information, its classification system is crude and 
deficient in a number of ways. Some alternative classifications will be con- 
sidered in Chapter 17. With regard to the DOT itself, the development of a 
new schema of classification has been in progress for several years and is 
expected to replace the original coding system in 1964 (27-29). By means 
of a nine-digit code, this system will classify occupations on the basis of worker 
functions (reflecting trait requirements), work field, and product or service. 
To date, the system has been applied to a list of 4,000 jobs (84). The in- 
formation given for each job includes general educational requirements, spe 
cific vocational preparation, aptitudes, interests, temperamental characteristics, 
physical capacities, and special working conditions (e.g., cold, heat, noise). 
The estimated trait requirements are based on ratings by job analysts. Aptitude 
pe on the General Aptitude Test Battery are also included when avail 
able. 

As in the case of psychological tests, we may inquire into the ways in which 
occupational information can be incorporated into the counseling process (sce 
42. chs. 10-12; 83, ch. 8). Like tests, occupational materials can be most 
effectively introduced when a specific need for them arises, to answer questions 
raised by the client. Generally, however, it is better to postpone the considera 
tion of occupational data until after some preliminary interviewing and pos 
sibly testing. Otherwise, a detailed examination of a specific field of occupation 
may lead to a premature structuring of the client's problem around vocational 
questions or to a premature vocational choice. 

Vocational information may serve several purposes in the process of voca 
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tional planning. One is the exploration of the world of work. The client may 
be looking for ideas about fields in which he would be interested and may 
want a general picture of the work performed in various jobs. It is in such 
cases that counselors often turn to comprehensive sources like the DOT and 
the Occupational Outlook Handbook to help the client investigate likely 
fields and identify jobs not previously familiar to him. A second use of voca- 
tional information is to aid the client in choosing among specific occupations. 
For this purpose, the client needs fuller information regarding the alternatives 
he is considering. Such information should not only cover obyious job charac- 
teristics but should also give some notion of the total way of life that a particu- 
lar vocational choice implies. Finally, occupational information may be re- 
quired to implement a vocational decision. For example, the client may need 
information about apprenticeships and union regulations in a skilled trade, 
about the educational requirements for law or medicine, about the offerings 
of different graduate schools in his chosen field, or about the availability of 
scholarship aid in various colleges. A single client may require occupational 
information at any one or more of these stages in his vocational planning. 

As for the method of presenting the information, the counselor may trans- 
mit the required data orally, refer the client to published sources, or combine 
the two procedures. Noting the client's reaction to the information and reflect- 
ing the client's feelings are important aspects of the counselor's contribution in 
this connection. Many supplementary methods can be employed to facilitate 
and augment the information-gathering process (see, e.g., 43). Schools fre- 
quently conduct regular courses on occupations, which may utilize several 
techniques of group guidance. This approach is helpful in providing a pre- 
liminary overall view of the world of work and in encouraging the student 
to think about relevant questions. Other procedures include group conferences 
with representatives of different fields, plant tours, and part-time or summer 
jobs. Ideally, techniques such as these should supplement individual counsel- 
ing. If used alone, they may yield an inadequate or distorted picture of the 
occupations surveyed. 


EVALUATING THE EFFECTIVENESS 
OF COUNSELING 


Test Validation Studies. Research on the effectiveness of counseling has 
been concerned with several different aspects of the counseling process. Any 
data on the predictive validity of the tests employed in counseling are of 
course relevant. Several longitudinal investigations have analyzed the relation 
between test scores and later vocational choices or indices of vocational adjust- 
ment, when the tests were not administered in connection with counseling. 
An unusually extensive study of this type was conducted by Thorndike and 
Hagen (81). In 1955 and 1956, these investigators sent a questionnaire to 
about seventeen thousand men who in 1943 had taken a comprehensive test 
battery when applying for cadet training in the United States Air Force. 
Approximately ten thousand responded. For purposes of analysis, the tests were 
grouped into five categories: general intelligence or scholastic aptitude, numeri- 
cal fluency, visual perception, mechanical ability, and psychomotor skills. Re- 
sponses to a large number of items in a specially devised biographical in- 
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ventory were also available. Initial test scores and biographical inventory 
responses were first analyzed in terms of the individual's present job. Within 
each job group, the predictors were then correlated with each of seven criteria 
of job adjustment, such as earned income and self-ratings for job success and 
for job satisfaction. 

Among the 124 occupational categories analyzed, several could be clearly 
differentiated in terms of level and pattern of test scores, as well as in responses 
to biographical inventory items. For example, accountants and auditors 
averaged very high in numerical fluency and high in general intelligence; they 
were about average in visual perception, below average in psychomotor skills, 
and very low in mechanical ability (Figure 110). Architects were distinctly 
high in visual perception; carpenters scored higher in mechanical ability than 
in general intellectual tests. To be sure, individual differences in scores within 
each occupational group were large, and overlapping among groups was con- 
siderable. Moreover, there were a number of occupations that yielded flat 
profiles with no outstandingly high or low scores. Nevertheless, the results show 
considerable Correspondence between the type of work in which these men 
were employed and their test performance some thirteen years earlier. 

Neither test scores nor biographical inventory items, on the other hand, 
proved to have any validity in predicting degree of job success within any one 
occupation. The correlations with the various criteria of job adjustment were 
no different from what would be found by chance. 

Despite its ambitious scope, this study has many limitations, which its 
authors describe fully. As a result of self-selection as well as preliminary 
screening by the Air Force, the available sample was clearly unrepresentative 
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of its age group. The proportion of nonrespondents leaves the way open for 
Further bias in the final sample. 'The number of persons in single occupational 
groups was often quite small. Even when similar occupations were combined 
and the smallest groups eliminated, nearly half of the remaining categories 
contained fewer than fifty cases. At the same time, the men classified within 
cach group were generally heterogeneous with regard to nature of work per- 
formed, even when their occupation was nominally the same. Finally, it is 
obvious that the criteria of job adjustment were very crude. Most of these 
limitations are such as to reduce the likelihood of finding significant correla- 
tions between predictors and criterion measures. The differentiation of voca- 
tional groups in terms of initial aptitude patterns, on the other hand, was 
iciently pronounced to emerge despite these methodological handicaps. 

it should be noted, furthermore, that other longitudinal studies of narrower 
scope have obtained very similar results (see 83, pp. 118-120). Both voca- 
al and educational groups can be differentiated in terms of their initial 
score patterns on aptitude and interest tests. But, within such groups, the tests 
gencrally show little relation to degree of success. Even when scores from 
different tests are weighted and combined in such a way as to simulate the 
sort of judgments a counselor might make in predicting a client's suitability 
for a particular vocation, no significant correlation is found between these 
predictions and indices of job success (53). 

ihe fact that tests can differentiate among occupational and educational 
groups probably reflects the selective processes that normally accompany ad- 
mission and survival in these groups. Through self-sorting, screening, and 
subsequent dropping out, persons who are unsuited for a particular course of 
study or job are unlikely to remain in it long. It is the purpose of tests, of 
course, to reduce the waste in this natural selection by steering persons away 
from unsuitable choices in advance. Among those who meet the minimum 
prerequisites for survival in an educational or vocational category, however, 
the degree of success or satisfaction attained depends upon such a complex 
interplay of personal and situational variables that test scores alone are of 
little help in predicting it. This conclusion is supported by the common find- 
ing that negative prediction from test scores can be made with more accuracy 
than can positive prediction (30). 

An ongoing nationwide survey of high school students may eventually pro- 
vide more definitive answers to the sort of questions investigated by the above 
studies. Known as Project TALENT, this research began in 1960 with the 
administration of a comprehensive battery of aptitude and achievement tests, 
as well as interest and personality inventories, to approximately 440,000 high 
school students (31). Information about each school and its guidance program 
was also obtained. In comparison with earlier studies, this project provides a 
wider coverage of tests and a larger and more representative sampling of sub- 
jects. The group was carefully chosen so as to yield a stratified sample of 
students enrolled in public, parochial, and private high schools throughout the 
country. In addition, in order to secure national norms for one complete age 
group, fifteen-year-olds not in high school were also tested. Present plans call 
for questionnaire follow-ups to be conducted 1, 5, 10, and 20 years after high 
school graduation. 

Use of Tests in Counseling. Some investigations have been concerned 
more directly with the effect of using tests in the counseling process. In these 
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studies the results are generally more promising than when tests were ad- 
ministered without counseling. Indices of job success and job satisfaction do 
show significant relations with predictions made from test scores under these 
conditions (see 83, pp. 116-117, 273-277). 

Typical of these studies is a British investigation conducted in the 1940s 
(45). A total of 1,639 young people who were leaving school at the age of 
fourteen to go to work were followed up after a two-year interval, and an 
additional 603 were followed up after four years. At the time of school leaving, 
half of the entire group was counseled on the basis of test information. The 
other half had only a vocational planning interview, which was standard 
procedure for school leavers. The jobs subsequently held by all counselees were 
classified into "accordance" and “non-accordance” posts depending upon 
whether they were or were not consistent with the counselor’s recommenda- 
tions. A two-way comparison could thus be made, between tested and non 
tested counselees and between those who had and those who had not followed 
counselor's advice. The criteria of occupational adjustment employed in this 
analysis were based on estimates of suitability for the job, secured from the 
subjects and from their employers. 

Consistent results were obtained with all criteria and in both two-year and 
four-year follow-ups. The counselees who had been tested made better job 
adjustments than those who had not. Within the tested group, those who were 
in accordance jobs were more successful than those in non-accordance jobs, but 
this was not true in the group counseled without tests. Furthermore, tested 
counselees who began in non-accordance jobs tended to shift in the direction of 
accordance jobs, whereas in the nontested group shifts tended to be away from 
counselor recommendations. 

This study and others following the same general procedures clearly indicate 
the value of tests in vocational counseling. That these results are more favor- 
able than those of follow-ups based on test scores alone may reflect the influence 
of the counseling process itself, In the interaction between counselor and client, 
other relevant factors can be integrated with test scores in choosing a suitable 
course of action. Moreover, the individual's participation in the counseling 
Process itself may have facilitated self-insight and personal development in 
ways that were beneficial to subsequent vocational adjustment. Tests provide 
opportunities for reality testing that may contribute to the effectiveness of this 
experience. 

Attainment of Self-knowledge. The emphasis placed by the above studies 
on whether or not the counselee followed the counselor's recommendations is 
characteristic of the earlier, more “directive” approach to counseling. The re 
cent swing toward nondirective and client-centered techniques is reflected in a 
corresponding shift in research. Studies conducted since the 1950s have con- 
centrated more heavily on the effect of different aspects of the counseling 
process upon self-knowledge (33, ch. 14; 83, ch. 7). A common procedure is to 
compare test scores in abilities or interests with self-ratings obtained before and 
after counseling or in a counseled and a control group. A smaller discrepancy 
between test scores and self-ratings or between counselor's and client's ratings 
is regardec vidence of better self-insight. 

There are a number of methodological weaknesses in this approach. The 
adequacy of counselor evaluation or test scores as a criterion for measuring 
accuracy of self-knowledge is certainly questionable. Furthermore, self-ratings 
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obtained after the counselee has been told his test scores or his counselor's 
evaluations may be contaminated by this knowledge. The individual may be 
simply parroting what he has heard without necessarily revising his self 
concept. On the other hand, if counseling fails to bring about the expected 
evidence of improved self-knowledge, it is impossible to determine why. It 
may mean that the tests or the counselor's evaluations were in error, that the 
counselee was poorly motivated to profit from counseling, that he rejected the 
knowledge because of defensiveness, or that the counselor did not communicate 
the information effectively. In view of these various methodological difficulties, 
it is not surprising that this type of research has so far yielded inconsistent 
and inconclusive results. 

Global Studies of Counseling Effectiveness. Other studies have been 
designed to check the over-all effectiveness of educational or vocational counsel- 
ing in terms of follow-up criteria (see 83, ch. 13). Although the findings of 
different studies are not entirely consistent, they generally show that the 
counseling of high school and college students has a favorable effect upon their 
subsequent academic and vocational adjustment. Such generalizations, of 
course, need to be qualified in terms of counselor characteristics, type of 
counseling, and nature of client problems. Several studies suggest, for example, 
that college students with marginal scholastic aptitude are unlikely to improve 
in academic achievement as a result of the usual counseling procedures (see 83, 
pp. 271-272). 

One of the most extensive and carefully controlled investigations of counsel- 
ing effectiveness was the Wisconsin Counseling Study (68). This study was 
concerned with the effects of a long-range guidance program rather than with 
those of a single counseling experience. In a research project extending over 
8 years, a guidance program was established in four representative Wisconsin 
high schools. Participating students were followed up 6 months, 234 years, and 
5 years after graduation. 

Studies that compare self-selected counselees with non-counseled control cases 
are subject to the unknown influence of motivational differences between the 
two groups. Even when the counseled and control subjects actually compared 
are matched a posteriori for a number of variables, there is no control over 
motivation to improve and tendency to use effective methods in solving life 
problems. It is likely that the group that seeks counseling spontaneously excels 
initially in these respects. This common methodological difficulty was avoided 
in the Wisconsin study by assigning students at random to the experimental 
and control groups. A total of 870 sophomores in the four high schools were 
placed in the two groups. Those in the experimental group were counseled 
during the remaining three years of high school by members of the university 
staff, who used generally accepted techniques of testing and interviewing. The 
control subjects received no counseling. , 

The results were consistently favorable to counseling, although the differ- 
ences between counseled and control groups were not large. In comparison 
with the control subjects, the counselees achieved slightly better academic 
records both in high school and in subsequent education, were more realistic 
about their weaknesses, were less dissatisfied with their high school experi 
ences, were more consistent about their vocational choices, made better progress 
in their chosen employment after graduation, and were more likely to son- 
tinue their education to high school graduation and beyond. 
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A somewhat different application of counseling is illustrated by the Demon- 
stration Guidance Project conducted in New York City (96, 97). The object of 
this project was to identify promising pupils from a culturally deprived neigh- 
borhood and to stimulate them to reach higher educational and vocational goals. 
Surveys have repeatedly shown that, even when adequate financial assistance 
is available, qualified high school students from lower socioeconomic levels are 
much less likely to go to college than are students from higher socioeconomic 
levels with the same intellectual qualifications (see 4, 42, 90). Moreover, 
achievement at the elementary and high school levels, as well as intelligence 
test performance, tends to be adversely affected by the attitudinal and in 
tellectual climate typically found in underprivileged homes. Children from such 
homes not only have a dearth of intellectual stimulation but also have little 
incentive for educational advancement. 

The Demonstration Guidance Project was a pilot program aimed to counter 
act these environmental deficiencies. The project was begun with about seven 
hundred pupils in the seventh, eighth, and ninth grades of a junior high 
school and was continued for six years, thus giving the youngest of the three 
groups time to complete high school. The procedure included an intensive 
counseling program with both individual and group guidance; counseling and 
social work with parents; remedial instructional programs; and cultural en 
tichment both in school and through field trips and visits to museums, art 
exhibitions, theaters, concerts, and the like. Clinical services were also provided 
in many individual cases, and financial assistance was made available when 
needed. 

Although final results have not yet been fully analyzed, the data gathered 
so far appear promising. Questionnaire surveys of pupils, parents, and teachers 
indicate that for most students the program provided substantial help in their 
choice of career, encouraged them to continue in high school until graduation 
and to consider the possibility of college, broadened their interests and their 
participation in cultural activities, improved their social adjustment and their 
attitude toward school, and raised their self-esteem. The dropout rate of this 
group in high school was approximately 20 per cent, as compared with about 
40 per cent for other classes. Of particular interest are the marked improve 
ments in educational achievement and the mean rise of about 9 IQ points on a 
verbal intelligence test, which is significant at the or level. The latter increase 
is all the more notable when contrasted with the decline in IQ with age 
usually found in culturally deprived groups. With regard to the various in 
fluences operating during the program, the investigators comment that the 
students intimate relation with their guidance counselor seemed to be a major 
factor in their improvement (96, p. 250). Because of the large number of 
independent variables and the absence of a comparable control group, however, 
it is impossible to draw causal conclusions from this study. 

Mention should also be made of investigations of the effectiveness of per 
sonal counseling in both academic settings and elsewhere. Insofar as this type 
of counseling is a form of psychotherapy, the research on the outcomes of 
psychotherapy discussed in Chapter 15 is also relevant here. Within the 
framework of counseling, we can repeat the conclusion reached from clinical 
research: so far, the effectiveness of counseling for personal adjustment has 
not been satisfactorily demonstrated (8; 83, chs. 11 and 13). The difficult 
methodological problems presented by both process and outcome research, 
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which were considered in Chapter 15, make the designing of definitive studies 
on personal counseling extremely difficult, 

Theoretical Analyses of Counseling. At a different level of evaluation, 
psychologists, educators, and members of related disciplines have examined 
the goals, underlying assumptions, and philosophy of counseling. The rapid 
expansion of counseling activities and the increasing professionalization of 
counseling services since the mid-fifties have led to some searching analyses of 
the responsibilities of the counselor. Typical of such examinations of the 
counselor's role is Wrenn's The Counselor in a Changing World (93). Repre- 
senting the report of a commission appointed to consider guidance in Ameri- 
can schools, this book describes needed developments in school counseling in 
the light of projected changes in our culture. The author explores the implica- 
tions of societal changes for the functions and training of counselors. Of 
particular interest is the analysis of the “culturally encapsulated counselor,” 
who is unresponsive to both temporal changes and subcultural differences in 
the value systems he employs in his counseling practice. Furthermore, because 
counseling is typically concerned with the client’s plans for the future, it is not 
enough for the counselor to evaluate his information in terms of its present 
accuracy; he must also check it against rate and direction of change. 

Since discussions of the objectives of counseling reflect both the value 
systems and the theoretical orientations of the discussants, it is not surprising 
that the conclusions reached are quite divergent. This is one of the areas of 
current controversy and ferment alluded to in the opening of Chapter 13. The 
diversity of opinions held regarding the goals of counseling is richly illustrated 
in a series of articles published in 1962 in an issue of the Harvard Educational 
Review devoted to an examination of guidance (18). Dealing principally with 
the functions of the counselor within a school setting, these articles cover such 
questions as: the role of counseling in shaping and modifying behavior, rather 
than merely predicting behavior; the influence of both the immediate school 
context and the broader societal setting upon counseling goals and practices; 
the unwitting “smuggling” into the counseling process of values that may not 
be explicitly recognized by the counselor; the impact of counseling upon 
education and its place in the total educational process; and the issue of social 
conformity versus spontaneity and individuality as counseling goals. For an 
adequate appreciation of the wealth of provocative ideas provided in this 
symposium, the reader is urged to consult the original articles. 


SUMMARY 


Counseling psychologists work in many settings and perform a variety of func- 
tions. Although they may specialize in one type of counseling, such as voca- 
tional, educational, employee, rehabilitation, old-age, marriage, or personal coun- 
seling, fully trained counseling psychologists recognize the interrelation of all 
adjustment problems and focus on the counseling of the whole person. While 
there are many similarities between clinical and counseling psychology, the 
fields as a whole can be differentiated in terms of objectives, type of client, 
duration of professional relationship with client, extent of information giving, 
and orientation of practitioners. In none of these variables, however, is the dis- 
tinction so sharp as to preclude overlapping of the two fields. 
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Among the types of tests most commonly used in counseling are general 
intelligence tests, educational achievement tests, special aptitude tests, multiple 
aptitude batteries, interest inventories, and personality inventories. Under the 
influence of the client-centered approach, counseling psychologists have placed 
increasing importance upon the ways in which tests are integrated into the 
counseling process. In counseling, psychological tests are used chiefly for the 
information of the client himself, to increase his self-knowledge and help him 
in making decisions. Some nondirective counselors leave the choice of tests up 
to the client. If applied within certain limits, this practice may make for a 
better counseling relationship and a more effective utilization of test results by 
the client. Counselors working in school systems generally have access t 
longitudinal data about their counselees in the form of cumulative records. 

Because of the importance of vocational decisions in most counseling ac 
tivities, counseling psychologists need to be familiar with sources of occupa 
tional information. An up-to-date and systematically organized library or file o! 
occupational materials should be available for the use of clients as well as 
counselors. The development of the Dictionary of Occupational Titles by thc 
United States Employment Service introduced a comprehensive, uniform classi 
fication and coding system for occupations, a classification which is now in 
process of revision. Like psychological tests, occupational information needs to 
be integrated into the total counseling process. 

Research on the effectiveness of counseling has been concerned with the 
long-range predictive validity of counseling tests, the effect of using tests in 
the counseling process, the influence of different aspects of the counseling 
process upon self-knowledge, and the over-all success of counseling programs 
in terms of follow-up criteria, All of these questions present serious methodo 
logical problems, only a few of which have been satisfactorily solved in avail 
able research. Thoughtful analyses of the underlying philosophy of counseling 
reveal wide diversity in value systems and theoretical orientation. 
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The most distinctive area of research in counseling psychology is that dealing 
with occupational choices. What are the determinants of such choices? When 
and under what conditions are vocational decisions made? What kinds o! 
satisfactions do individuals seek in their work? How do these satisfactions 
differ among persons who choose different occupations? That counseling ps) 
chologists should focus upon occupations in their research programs is under 
standable for a number of reasons. First, there is the historical association of 
counseling with vocational problems. It will be recalled that the vocational 
guidance movement was the principal factor contributing to the rise of counsel 
ing psychology. Even today, when the scope of counseling has broadened con 
siderably, many counseling psychologists still function predominantly in the 
vocational field. 

To be sure, the counseling psychologist typically considers vocational prob 
lems within the context of the client's overall emotional adjustment. It is 
recognized that the vocational problem presented by the client may itself arise 
from emotional immaturity, insecurity, inability to face reality or make de 
cisions, a difficult familial situation, or other problems of personal adjustment. 
Conversely, there is increasing realization that occupation itself plays a central 
tole in one’s total life adjustment. Emotional problems frequently stem from 
job tensions and frustrations, while absorbing and rewarding work can be highly 
therapeutic. Similarly, maladjustments can often be prevented through satisfy 
ing work activities. Not only the work itself, but also its many associated ex- 
periences, can contribute to the individual’s total adjustment. Of considerable 
importance to mental health, for instance, are acceptance by fellow workers 
and the feelings of security and belongingness that come from identification 
with an occupational group. 


OCCUPATION AS A WAY OF LIFE 
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Rationale. Choosing an occupation is equivalent to choosing a way of life 
(5; 49, ch. 23; 58; 59, ch. 2). For one thing, the individual spends a large pro 
portion of his waking time on the job. But the influence of one's occupation 
extends well beyond working hours. For the vast majority of people in our 
culture, the nature of their work is the single most important determiner of 
social status. When trying to "place" a new acquaintance, people ask, "What is 
Wilson?" and are told that he is a lawyer, or a carpenter, or a hairdresser. The 
income derived from work helps to shape one's general style of living—the 
neighborhood in which he lives, the sort of house or apartment he occupies, 
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the car he drives, the kind of vacation he can afford. One's job determines the 
distribution of work and leisure in both daily and seasonal schedules—as 
illustrated by the jobs of physician, long-distance truck driver, freelance writer, 
schoolteacher, and bank clerk. Some jobs limit the places where one may live 
—as illustrated by the jobs of forest ranger, mining engineer, music critic, and 
investment broker. In many other kinds of work, some geographical mobility 
is required for maximal vocational advancement. A research chemist or a pro- 
fessor of philology must be free to go where the best opportunities are available 
in his field of specialization. He may be a native Californian with all his family 
ties in the San Fernando valley, but his best job offer may come from Harvard. 

Occupations differ in many other ways that may be significant to an indi- 
vidual worker. They may influence one's habitual dress, as witnessed by our 
stereotypes of white-collar and blue-collar jobs and of the businessman in the 
gray flannel suit, They differ widely in the physical and psychological sur- 
roundings in which the work is done. Contrast, for example, the neatness and 
order of the normal working environment of a surgeon with that of a con- 
struction foreman or the interpersonal contacts of a bill collector with those of 
à commercial artist. Friendships are often work-determined. People tend to 
spend their leisure hours with co-workers, business associates, or professional 
colleagues. Nor is it at all unusual for an individual to marry someone he or 
she has met through job contacts. 

Many kinds of work are associated with characteristic sets of values that 
frequently extend into one's personal life. If some of these values are not con- 
sonant with the individual's own values, the resulting conflict may be quite 
disruptive. In view of the degree to which vocational decisions imply the 
choice of a whole way of life, it becomes easier to understand the success of 
interest tests such as the Strong Vocational Interest Blank (5, 15). Persons in 
different occupations are in fact characterized by distinctive interest patterns 
and sets of values that reach beyond the work itself into nearly every aspect 
of daily living. 

It might be added that among women it is not only the so-called "career" 
women who need to be concerned about occupational differences. The job of 
housewife is by no means uniform. Much has been written about the peculiar 
problems and way of life of the wives of medical doctors, career officers in the 
armed services, and business executives. Although these examples have been 
widely publicized and dramatized, the same principle holds in all walks of life. 
The husband's occupation permeates and colors every facet of the wife's world 
in much the same way that it affects his own. 

From a different angle, the choice of an occupation can be seen as a way of 
establishing one's identity. Super (58-60) has repeatedly pointed out that 
vocational choice is the implementation of a self concept. He argues that "satis- 
faction in one's work and on one's job depends on the extent to which the 
work, the job, and the way of life that goes with them, enable one to play the 
kind of role that one wants to play" (58, p. 189). 

More specifically, occupations are chosen to meet needs (see 33, chs. 7 and 
8). The economic need to earn one’s livelihood is only one of many such needs. 
It can ordinarily be met by any one of many jobs, among which the individual 
chooses on the basis of other needs, For some persons, rate of pay may be so 
closely linked with one or more of their major goals that it will be the determin- 
ing factor in a job decision. But, for most persons, other needs either determine 
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or significantly affect vocational choices. Thus one person may put such a high 
value on personal freedom and independence that he can only be happy as a 
free-lance or self-employed worker. Another functions best when he is th: 
center of attention—a need that could be satisfied by many occupations, rang 
ing from actor to information clerk. Still another requires the security that 
comes from working in a highly structured situation in which he is relieved of 
all responsibility for decision making. 

These needs are not necessarily verbalized (19). The individual may simpl; 
feel that he must paint or that an office job is not for him, without being abi. 
to explain why. It is one of the objectives of counseling to make the individu:! 
aware of the motives underlying such convictions. When needs are recognized 
as such, they are more readily brought under rational control and less likely : 
lead to unrealistic decisions. Throughout such a process, however, the counselo 
must operate within the framework of the client's needs, which may differ 
from those of the counselor. This is one of the reasons why the counselor can- 
not choose a “suitable” occupation for the client without at least some intcr 
action with the client. Parenthetically, the consideration of needs in vocational 
decisions highlights one difference between the characteristic approaches of 
clinical and counseling psychologists. If, for example, a client has a deep-rooted 
feeling of insecurity, the clinician typically attempts to change the client's 
personality so that he may overcome the insecurity, whereas the counselor helps 
the client choose a vocation in which emotional insecurity is not a serious 
drawback. 

Occupational Personality Patterns. There is a mounting accumulation 
of research literature dealing with personality differences among occupational 
groups. The most comprehensive survey of studies published prior to 1956 is 
provided by Roe (49, chs. 13-20). Summaries of portions of this literature pre- 
pared from different points of view can be found in several other sources C1, 
ch. 13; 15, ch. 4; 30; 59, ch. 16; 62; 69, PP- 172-176). A number of studies 
have appeared more recently and others are in ‘progress (4, 12, 29, 31, 32, 36- 
39, 53, 55, 73). Some of this research has been conducted by psychologists, 
although sociologists and members of other disciplines have contributed ex- 
tensively to this topic. The methodology ranges from rather subjective bio- 
graphical approaches to moderately well-controlled applications of question- 
naires, interviewing techniques, personality inventories, and projective tests. A 
wide variety of occupations has been investigated, including those of scientists 
ina number of fields, mathematicians, engineers, medical students, psychiatric 
residents, schoolteachers, architects, and many others. Some investigators have 
compared student groups specializing in different fields, such as accounting and 
creative writing (52). Others have worked with persons employed in the 
various occupations; a few have concentrated on scientists who have attained a 
high degree of eminence in their fields 

Data on developmental histor 
ferent occupations w 


y and childhood experiences of persons in dif- 
were generally obtained retrospectively during interviews 
with the adult subjects. A notable exception is the longitudinal study of a 
group of California children with initial Stanford-Binet IQs of 140 or higher, 
conducted by Terman and his associates, Among the many analyses of the 
voluminous follow-up data obtained in this project is a comparison of the men 
who eventually became physical scientists with those specializing in social 
science, law, or the humanities (64). Out of a total of about 500 items, includ 
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ing test scores, ratings, and biographical data, 108 differentiated between the 
two groups at the .o5 level of significance or better. Most of these discriminat- 
ing items dealt with interests, including evidence of early scientific interest in 
hood and scores on the Strong Vocational Interest Blank. 

Ihe close association between vocational interests and other personality 
characteristics has been suggested by a number of studies with the Strong 
VIB (see 2, pp. 556-559; 15, ch. 4). An example is an investigation of 100 
Air Force officers conducted at the University of California (see 15, pp. 128- 
). Each subject took a battery of tests, including the Strong VIB, and also 
ierwent an intensive assessment program through interviews and other 
rvational techniques. On the basis of all available information, subjects 
e described by eight clinical psychologists in terms of 76 given personality 
variables, Correlations of these trait ratings with each of the VIB occupational 
keys revealed a number of statistically significant correlations. The personality 


descriptions associated with high scores on two of these keys are summarized 
below: 
High scorers on Mathematician Key: Self-abasing, concerned with philosophical 


problems, introspective, lacking in social poise, lacking confidence in own ability, 

sympathetic, reacts poorly to stress, not persuasive in personal contacts, not an 

effective leader, not ostentatious, not aggressive or socially ascendant 

High scorers on Real Estate Salesman Key: Self-indulgent, guileful, cynical, op- 
portunistic, aggressive, persuasive, ostentatious, may arouse hostility in others, not 
sympathetic, not concerned with philosophical problems, not lacking confidence 
in own ability, not self-abasing 


Available published research on occupational personality patterns provides 
many promising hypotheses, but as yet few differences have been conclusively 
established. The number of cases investigated is often small. Few studies have 
included control groups. It is therefore impossible to determine whether the 
personality characteristics found in a given occupational group differentiate 
that occupation from others or whether they are common to men at a particular 
age level, who have attained a certain degree of vocational success, or who 
resemble the experimental subjects in other ways. For instance, in Roe’s studies 
of eminent scientists, the results pointed strongly to the importance of prestige 
motivation, strong inner drive, sustained effort, and absorption in work to the 
exclusion of other interests (47, 48). There is no way of knowing to what 
extent these qualities are also typical of men who have achieved distinction 
as artists, writers, business executives, statesmen, or workers in any other 
field. 

In studies that compared two or more occupational groups, the findings were 
rarely cross-validated on. new samples to determine which of the observed 
differences might have resulted from chance. Even when significant per- 
sonality differences are conclusively established between occupational groups, 
it is often impossible to analyze cause-effect relations. To what extent do sales- 
men become dominant and extroverted because of the demands of their jobs, 
and to what extent were they that way to begin with? Only longitudinal 
studies can adequately answer this kind of question. f 

At best, what occupational differences have been found in personality pat- 
terns represent group trends. For many kinds of work, one could probably 
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write personality sketches that would roughly fit a majority of persons in that 
occupation. This sort of information is useful in counseling insofar as it gives 
the counselee some idea of the kind of person with whom he is most likely 
to associate if he chooses a given type of work. But, within any occupational 
group, the range of individual differences is fully as wide in personality vari- 
ables as it is in aptitudes. Nor are these differences necessarily associated with 
different degrees of success. The breadth and flexibility of most occupations 
are such that different persons may succeed in them for different reasons. 
Individuals may choose specialties (as in law or medicine) that are consonant 
with their own needs, interests, and values. Even a single job can often be 
structured by the individual to fit his own aptitude and personality pattern. 
Moreover, the same job in different companies or under different supervi: 
may call for very different personality traits. Specific jobs as well as occupa- 
tional fields undoubtedly vary in the degree of specialization and restructer- 
ing that they permit. This is one of the variables that should be considered 
in making vocational decisions. 

Job Satisfaction and Individual Needs. The studies cited in the preced- 
ing section were concerned chiefly with comparisons among occupational 
groups. Another approach is to investigate the correlates of job satisfaction 
within an occupation. For example, 72 men employed predominantly in pro- 
fessional and semiprofessional jobs filled out a questionnaire designed to assess 
the strength of 12 needs, such as those for recognition, achievement, self- 
expression, economic security, and independence (51). The subjects also rated 
their general job satisfaction and the degree to which their job permitted the 
satisfaction of each of the 12 needs. A multiple correlation of .58 was found 
between job satisfaction and the extent to which the individual's two strongest 
needs were satisfied by his job. 

In another study (8), 81 nurses were given several self-report inventories 
designed to assess, among other things, the individual's concept of her actual 
and ideal occupational roles, her self concept, and her vocational satisfaction. 
The results supported the hypothesis that, the larger the discrepancy between 
self concept and perceived occupational role, the lower the vocational satisfac- 
tion. Discrepancy between ideal and actual occupational roles was also nega- 
tively correlated with vocational satisfaction. 

There is some evidence that persons with a high achievement drive tend to 
prefer occupations with higher status or prestige. In a group of 53 young vet- 
erans, a correlation of -74 was found between the individual's achievement 
drive as assessed by a projective test and the prestige level of the occupation 
he chose (41). 

A series of studies conducted in Columbus, Ohio 
the relation between job satisfaction and pe i 
The research was prompted by the difficulty 
health personnel for state institutions, as w 
among persons in these jobs. The principal 
including 8o in each of the following occupa 
social workers, teachers, and nurses. In eac 


were employed in state institutions and 40 in other settings (clinics, agencies, 
public schools, general hospitals, or private practice). There was also a control 
group selected by area sampling of Columbus, Ohio. 


S 


investigated more directly 
rceived job status (34, 45, 46). 
of recruiting professional mental 
ell as by the high turnover rate 
sample consisted of 400 persons, 
tions: psychiatrists, psychologists, 
h occupational group, 4o persons 
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By means of a series of rating scales filled out by the subjects, it was dem- 
onstrated that the needs for recognition, status, and prestige are closely related 
to job satisfaction among these professional workers. Discrepancies between 
expected and perceived job status were associated with low job satisfaction. 
Another common source of dissatisfaction was a discrepancy between wanted 
and available intellectual stimulation on the job. For all the professional groups 
studied, both perceived status and job satisfaction were found to be lower for 
institutional than for noninstitutional samples. These findings highlight the 
fact that need satisfaction may differ, not only among occupations, but also 
among work settings within the same occupation. 

The increasing emphasis upon the contribution of motivational factors to 
occupational choice and job satisfaction has led to the development of several 
instruments for measuring vocational needs. In an attempt to identify common 
factors or dimensions in the variables measured by these scales, Crites (14) 
administered three available inventories yielding 11 scores to 3oo college stu- 
dents. In these inventories, the subject rates types of work, ranks job incen- 
tives, or records degree of liking for various items. A factor analysis of the 
11 scores yielded five independent factors: material security versus job free- 
dom, personal status versus social service, social approval, system (self-created 
order and planfulness), and structure (externally imposed orderliness, work- 
ing on schedule, clearly defined duties). 

Although motivational factors undoubtedly play an important part in voca- 
tional adjustment, there is a danger that they may be overemphasized at the 
expense of performance factors. As vocational counselors have moved in the 
direction of personal counseling, with its more nearly clinical orientation, they 
have tended to lose sight of the realistic goal of job performance. Brayfield 
(72) has cautioned against this deemphasis of performance and has advocated 
a closer integration of industrial and counseling psychology. He writes, "We 
need the healthy antidote of a performance-oriented industrial psychology to 
balance the uncritical acceptance of the self-realization goal of vocational 
counseling" (72, p. 42). : 

Childhood Experiences and Occupational Choice. Early family experi- 
ences have been cited as a principal source of the emotional needs and value 
systems that underlie vocational choices (20, 49, 50). Case histories provide 
many examples of such childhood influences. The individual’s reaction to 
authority on the job may reflect his early relation with one or both parents. 
A worker may seek security on the job as a result of uncertainty and insecurity 
in his early family environment. A more specific hypothesis in terms of parent- 
child relations has been proposed by Roe (50). Protecting OE demanding pe 
rental attitudes, according to this hypothesis, predispose the individual to seek 
occupations dealing with persons; rejecting, neglecting, and casual Pen 
attitudes predispose him to seek occupations dealing with things, or "non- 
persons." Despite the readiness with which case-history material can be found 
to illustrate these childhood influences, systematic tests of this hypothesis have 
failed to support it (23, 25, 71). The general procedure has been to rae 
recalled parent-child relations of persons engaged in different occupations. No 
significant relations were found between childhood familial relations and such 
occupational choices. : 

The limited number of occupations investigated in some of these studies 
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may have been poorly chosen to represent contrasting need patterns. A more 
serious methodological weakness is the reliance on subjects’ recollections of 
childhood conditions. A longitudinal approach would provide more dependabi 

data. It is quite likely, however, that the etiology of vocational choices is more 
complex than this hypothesis presupposes. While childhood familial relation 
ships may be of primary significance for some persons, other factors may affect 
the development of emotional needs in other instances. Experiences in school, 
contacts with adults and age mates outside the family, and innumerable othe: 
circumstances of daily life may play a part. Moreover, the characterization of 
familial environment in terms of a single variable such as warmth—or in 
terms of a few categories such as protecting, demanding, and rejecting—may 
be an oversimplification. The interaction among members of a family is highly 
intricate and may affect individual members differently. 


THE CLASSIFICATION OF OCCUPATIONS 


Dimensional Classifications. Psychological research on occupations has 
quite naturally led to attempts to classify occupations in terms of psychologi 
cally meaningful dimensions. The original DOT classification, while represent- 
ing a significant achievement in its comprehensiveness and general appli- 
cability, had many drawbacks. One of its principal weaknesses stems from the 
mixture of different dimensions, or principles of classification. This confusion 
can be readily scen by examining its major categories, which correspond to 
the first digit of the occupational code, as shown in Table 22. The professional 


TABLE 22 Major Occupational Categories 


in Original Dictionary of Occupational Titles 

(Data from 70, p. ix) 

E —————  áÓ— as 
Category Code Number 


Professional and managerial 
Clerical and sales 
Service 


0 

1 

y 2 
Agricultural, fishery, forestry, and kindred occupations 3 
4 

6 


Skilled 
Semiskilled 
Unskilled 


and managerial category is a combination of level and field of work; clerical 
and sales, as well as service occupations, refer predominantly to fields; agri- 
culture and related occupations are classified together because of the setting 
in which the work is performed; and the last three categories are obviously 
based on level of skill required. The finer subdivisions within each major cate- 
gory show the same mixture of classification principles. For example, the ser- 
vice occupations are further broken down into domestic 
and building services, These groups combine à; 
done. 


Several substitute classifications have been proposed for use in counseling 


personal, protective, 
work setting with nature of work 
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and research, in which at least two of the dimensions of the DOT classification 
are separated (see, e.g., 26, 49, 59). The most obvious distinction is that be- 
tween level and field. Level is identified with such occupational variables as 
income, social status and prestige, intelligence, general educational require- 
ments, degree of authority, freedom and independence of action, and amount 
of responsibility for decision making (49, ch. 11; 59, ch. 3). Because these 
social and psychological variables are highly intercorrelated, it is feasible to 
classify occupations unidimensionally with regard to this composite definition 
of level. Although individual jobs might be differently ranked in terms of 
one or another of these variables, there is little difficulty in placing them in 
broad categories on this basis. With regard to worker requirements, occupa- 
tional levels are differentiated principally in terms of educational level and 
performance on scholastic aptitude or general intelligence tests. 

Field of work refers to the type of activity performed. Although operating 
at the same level, for example, a bricklayer and an electrician are in different 
fields, as are a physiologist and a professor of English literature. Psychologi- 
cally, field corresponds closest to differences in aptitude patterns and in inter- 
ests, Certain other personality differences may also characterize special fields, 
but the evidence for such differences is still meager, as indicated in the pre- 
ceding section. 

Super (59, ch. 3) has added a third dimension to this occupational classi- 
fication, which he calls enterprise. This dimension refers to the setting or 
industry in which the work is performed, such as agriculture, transportation, 
or government. Although not so basic psychologically as the classifications into 
level and field, choice of enterprise may be quite important in individual 
cases. A person who finds it oppressive to work indoors could profitably explore 
opportunities in agriculture and kindred occupations; one who has a deep- 
rooted aversion to bureaucratic procedures might be well advised to eschew 
employment in a government agency. Jobs in the same field and at the same 
level may appeal to very different kinds of persons when performed in different 
settings. Compare, for example, the jobs of three chemists, one employed in 
à university, another in a government laboratory, and the third in a manufac- 
turing plant. The previously cited studies of mental health professionals work- 
ing in institutional and noninstitutional settings may also be recalled in this 
connection. , 

The three-dimensional schema of classification proposed by Super is illus- 
trated in Figure 111. In this classification, Super combined the Standard In- 
dustrial Classification prepared by the United States Bureau of the Budget 
with a modified version of the leveland-field classification suggested by Roe 
(49, chs. 11—12). Within each field, typical occupations have been listed at 
different levels, Thus, for the Outdoor-physical field, examples range from ath- 
letic coach (level 2) to deckhand (level 6); for the Social-personal field, they 
range from social scientist (level 1) to attendant (level 6). Enterprise, cor- 
responding to the third dimension, cannot be plotted directly on paper but 
has been indicated in perspective. The broken line illustrates the three dimen- 
sional classification of one job, that of "civil engineer employed in conservation 
work with the National Park Service." This job is classified in level 2, field V 
CMathematics-physical sciences), and enterprise A (Agriculture, forestry, fish- 
eries). 
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Classifications According to Trait Patterns. In the attempt to integrate 
job classifications more closely with worker requirements, psychologists have 
for some time been exploring the trait patterns demanded by different jobs. 
Among the early efforts in this direction is a set of ratings of 123 occupations 
according to their minimum requirements in 24 traits, prepared for use in the 
vocational-guidance program of the British National Institute of Industrial 
Psychology (42). 

The Minnesota Occupational Rating Scales (MORS) represent a more 
extensive and systematic application of the same type of rating procedure 
(44). In these scales, each of 432 occupations has been rated for the amount 
of each of seven abilities which it requires. The traits include academic ability, 
mechanical ability, social intelligence, clerical ability, musical talent, artistic 
ability, and physical agility. For every occupation, each of these seven traits 
is rated on a four-point scale CA to D). These ratings were based on the pooled 
judgments of vocational psychologists. 

The 432 occupations are first arranged alphabetically, each with its cor- 
responding ability pattern. To facilitate the use of the scales in counseling, 
the some occupations are reclassified in several other ways on the basis of their 
ability patterns. For example, one classification groups together all occupations 
requiring the A level and those requiring the B level of each ability in turn. 
It should be noted that, in the MORS, level is rated for each of the seven 
traits, rather than for the job as a whole as was done in the previously de- 
scribed dimensional classifications. With regard to mechanical ability, for ex- 
ample, an electrical engineer and a toolmaker fall into the A level, while a 
street sweeper and a lawyer fall equally into the D level (44, p. 20). 

Mention should also be made of the empirically established Occupational 
Ability Patterns based on General Aptitude Test Battery scores (24). Each of 
these Occupational Ability Patterns shows the minimum cutoff score in critical 
aptitudes for a given job. These score patterns were found by administering 
the complete battery to samples of workers in many different occupations. 
Critical aptitudes were those in which workers in a given job excelled the 
general norms, as well as those yielding significant correlations with criteria of 
job success. Occupational Ability Patterns have so far been developed for 
over five hundred occupations. T. 4 

Both the MORS ratings and the GATB score patterns are limited to apti- 
tudes. More recently, counseling psychologists have been exploring the possi- 
bility of incorporating interests, needs, and other personality factors into the 
descriptions and classifications of occupations (e.g, 7, 30, 31). The functional 
classification of occupations, to be employed in the forthcoming revision of 
DOT, includes some interest and personality variables (18). This classification, 
which was briefly described in Chapter 16, represents an extension of the 
approach followed in the MORS. Ratings by job analysts were again utilized 
to establish worker requirements in general educational development, specific 
vocational preparation, 11 aptitudes, 10 interest areas, 12 “temperament quali- 
ties (such as need for variety and change or ability to perform under stress), 
and 6 physical traits. Occupational Ability Patterns on the GATB will also be 
included for all jobs for which they are available. The application of this 
functional classification to all DOT jobs will undoubtedly be the most ambi- 
tious vocational classification project yet undertaken. 
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VOCATIONAL DECISIONS 


One of the most important objectives of all counseling is to facilitate the 
making of effective decisions. In contrast to the charlatan, who is likely to 
hand his client a ready-made solution for his problems or to choose a specific 
occupation for him, the counseling psychologist concentrates on improving the 
individual's own decision making. Effective vocational decisions require ac 
curate knowledge regarding both abilities and wants or needs. T'ests help in 
this connection, as does much of the verbal interaction between counselor »nd 
counselee. In addition, effective decisions require knowledge about occupa- 
tions, which counseling likewise tries to provide. Finally, they require the 
ability to think clearly—to combine available information in predicting out- 
comes, to weigh alternative courses of action, and to make choices that ade- 
quately reflect all relevant factors. 

Vocational planning normally involves, not one decision, but a multitude of 
decisions, which typically extend over a period of several years. Many prelimi- 
nary decisions may have to be made and implemented before the individual 
finally enters his chosen field of work. Moreover, people's wants and their 
suitability for different kinds of careers may change with time. Jobs also 
change. In our rapidly evolving society, new fields of work constantly em 
and even familiar jobs may become drastically altered. All these circumstanc 
call for flexibility in the vocational decisions made at any one stage. Also rele- 
vant is the multipotentiality of both persons and jobs, to which we have re- 
peatedly alluded. Any one person is qualified for many different jobs, not just 
for a single job. Conversely, a given occupation may be successfully pursued 
in many different ways by different persons. 

The goal of vocational counseling is not early commitment to an ultimate 
vocational choice. Rather, counseling contributes to vocational decisions in at 
least three ways. First, it helps the individual in making immediate decisions 
as demanded by his present environment. For instance, a junior high school 
student may have to choose among different types of schools, curricula, or 
courses; or he may have to decide whether to continue his education or take a 
job. Such decisions require that the individual consider the general direction 
in which he wants to go and the major future implications. of his decision. 
But there will be many opportunities for modifying the direction later and for 
making more specific choices in the light of subsequent experience. 

In the second place, counseling may provide knowledge about oneself and 
about the world of work that will be used later—perhaps several years later 
—when the occasion demands it. At the time of counseling, the individual 
may have reached no decision at all. But as a result of the information he 
acquired, he may accept or reject a specific job when it is offered to him. If 
it contributed to the wisdom of that decision, counseling was effective. Finally, 
the counseling process offers an opportunity to learn more effective procedures 
for decision making itself. This experience is designed to help the individual 
Ps wiser and more satisfying decisions in solving future problems of daily 
ife. 

Occupational choice represents a synthesis between wishes and reality, be- 
tween what one wants and what one learns he can attain (59, ch. 21). 
Through continuing interaction with his environment, the individual tests his 
personal needs, aptitudes, and other attributes against the demands and re- 
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sources of society. In the process of growing up, he has an opportunity to 
explore different roles through fantasy, play activities, schoolwork, extracurric- 
ular functions, household tasks, and part-time or temporary jobs. Specific in- 
terests probably develop in much the same way. As Strong put it, “An interest 
is an expression of one's reaction to his environment. The reaction of liking 
disliking is a resultant of satisfactory or unsatisfactory dealing with an object" 
(57, p. 682). This resultant obviously depends upon the characteristics of the 
individual as well as upon the characteristics of the particular objects he en- 
counters and the circumstances under which he does so. 

The extent to which one’s vocational choices are reality-oriented may be 
related to his general personality adjustment. A study designed to test this 
hypothesis was conducted with 100 boys between the ages of fifteen and nine- 
teen who had been referred to a community counseling center for vocational 
counseling (54). The group included 50 “definitely maladjusted” referrals 
from psychiatric hospitals and clinics and 50 “better adjusted” referrals from 
schools and similar sources. In the course of counseling, each boy was asked 
to state two vocational choices in order of preference. On the basis of the 
counselor's information about the boy and the jobs, each choice was given a 
"reality-deviation score.” 

Analysis of these scores revealed that both first and second choices were 
significantly more realistic among the well-adjusted than among the mal- 
adjusted boys. This difference was larger for the first than for the second 
choices. Moreover, among the well-adjusted the first choices tended to be more 
realistic than the second, while the reverse was true among the maladjusted. 
Although such findings do not permit an analysis of cause-effect relations, the 
more realistic reaction of the well-adjusted subjects is of interest in its own 
tight. Insofar as counseling can lead the individual to make more realistic 
choices, it may contribute to his over-all adjustment. On the other hand, the 
counselor must guard against any tendency to brand as unrealistic vocational 
aspirations that fail to conform to his own set of values or vocational plans that 
reveal the courage and breadth of vision to pursue a novel or difficult path. 

Tyler (68; 69, ch. 14) has focused attention upon the important role of de- 
cisions in the individual's general development. Within any lifetime, only a 
small fraction of one's “potentialities” can be realized. A person may have the 
qualifications to become a distinguished surgeon, writer, and statesman, but 
there is not enough time to be all three. Choices must be made throughout 
life and each in tum limits and determines the direction of subsequent de- 
velopment. 

At the outset, these limits are set largely by external circumstances. For 
instance, a newborn infant could learn any existing language—or even a non- 
existing language that someone might devise. But what he actually learns to 
speak is quickly narrowed down to the language of his parents. As the child 
becomes older, however, the limits of his development are established more 
and more by his own choices—whether he spends more time on schoolwork 
or play, whether he plays baseball or collects stamps, the companions he 
chooses, the books he reads, and so forth. Although available opportunities, 
decisions by other persons, and his own physical and psychological attributes 
Continue to set many limits upon his behavior, the individual himself partici- 
pates increasingly in the process. At every choice point he encounters, he 
helps to set his own limits and to shape his future experiences. 
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In this concept of choice, Tyler sees a rationale for the successful pre 
dictions of occupational choice made with such instruments as the Strong 
Vocational Interest Blank. A high score on this inventory signifies that the 
individual's pattern of choices resembles that of persons in a given occupation. 
This in turn should help us to predict how he will choose in similar circum 
stances in the future. 

Much research on the nature of choices remains to be done. Tyler (67 
describes an ongoing longitudinal study of children's expressed likes and dis 
likes from the first grade through high school. The interest or choice patterns 
of these subjects during their high school years are being analyzed with ref- 
erence to many concurrent and earlier variables. The influence of positive and 
negative factors in the choice process is another question investigated in the 
same project. To what extent are choices based on the rejection of what on: 
dislikes and to what extent on the acceptance of what one likes? Some data on 
this question have also been obtained by comparing like and dislike responses 
on the Strong VIB (3, 67). 

Vocational choice can be analyzed in the light of decision theory. There 
is a growing number of psychological experiments conducted within the 
framework of decision theory, which should provide some of the basic re- 
search for counseling. Decision theory began as an attempt to put the decision- 
making process into mathematical form so that available information may be 
used to arrive at the most effective decision under given circumstances. The 
mathematical procedures employed in decision theory are often quite complex, 
and few are in a form permitting their immediate application to psychological 
problems. Nevertheless, the basic concepts of decision theory help in formu- 
lating and clarifying the decision process. A simple and readable introduction 
to these concepts can be found in a book by Bross (9). A more detailed treat- 
ment oriented toward applications to psychological research is given by Ed- 
wards (16). 3 

Essentially, decisions are based upon: (1) predicted probability of different 

outcomes; (2) estimates of risks involved or seriousness of failure; and (3) con 
sideration of the relative value or importance of different goals. In vocational 
decisions, information about the individual's capabilities, obtained from tests 
or other sources, together with job information, provides the data for predict- 
ing outcomes. Thus the individual, with the help of the counselor, might 
estimate his chances of completing an apprenticeship as machinist or of grad- 
uating from medical school. The consequences of making a bad decision help 
to establish the certainty level we require to take action. If the consequences 
of failure are slight, we may be willing to try a course of action even though 
there is a high probability of failure. 
_ Both decision theory and counseling psychology recognize the fundamental 
importance of values in the decision process. The fact that to be workable the 
individual's decisions must be consonant with his own value system provides 
one more reason why counselors cannot make decisions for clients. In decision 
theory, the mathematical expression of the relative values of different out- 
comes constitutes a particularly thorny problem. Values, however, inevitably 
enter into decision making; decision theory merely requires that they be made 
explicit. 

Most available psychological research on the decision process is exploratory 
and deals with questions that are far removed from the counseling situation- 
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A few studies concerned more specifically with vocational decisions provide 
suggestive findings. There is some evidence, for example, that college students 
| who are undecided about their career plans and are unable to specify a voca- 
| tional choice tend to be more reluctant to take risks and place a higher value on 
security than do those reporting definite career plans (see 69, p. 209). It is 
likely that an unrealistic demand for certainty makes for indecisiveness in 
vocational planning. A potential source of raw data for applied research on 
decision making is to be found in counseling interviews that deal with decision 
problems. So far virtually no use has been made of such data. 
An excellent example of the practical application of vocational counseling 
principles is provided by a workbook entitled You: Today and Tomorrow (35). 
Designed for a group guidance unit at the eighth- and ninth-grade level, this 
workbook introduces the student to vocational problems and provides exercises 
in self-appraisal, in the utilization of educational and vocational information, 
and in decision making. Data regarding abilities can be obtained from intelli- 
gence, multiple aptitude, or achievement tests administered in the regular 
school-testing program, as well as from the student's grade record. In addition, 
there is provision for the student to consider his schoolwork, extracurricular 
activities, hobbies, and other experiences in the appraisal of his abilities, in- 
terests, and values. The handbook presents problems and information in an 
interesting and highly readable style. An excerpt is reproduced in Figure 112. 
With its accompanying teacher’s guide, You: Today and Tomorrow is suit- 
able for use by a classroom teacher in a program extending over about thirty 
class sessions. Its focus is on the student's immediate task of choosing a high 
school curriculum, both for its direct effect on later vocational choices and for 
its value as a paradigm in decision making. Preliminary evaluation studies 
indicated that the program was well received by students, teachers, and school 
counselors. Pretests and posttests demonstrated significant learning of key con- 
cepts. In one study, pupils were intensively interviewed before and after their 
Participation in the program (22). Indices of maturity in educational and 
vocational planning, derived from the interview records, showed highly signif- 
cant improvement. Because of the lack of a control group, however, it is im- 
possible to determine how much of this gain may have resulted from the 
repetition of the interview or from other intervening experiences. 


VOCATIONAL DEVELOPMENT 

Rationale. The recognition that vocational decisions are made over a period 
of many years has led to the concept of vocational development, which has 
found its fullest expression in the work of Super and his associates (59, 61). 
Drawing upon both developmental psychology (10) and sociological analyses 
of occupational behavior (21, 40), Super discusses the process of vocational 
adjustment in terms of five stages. The first is the growth stage, extending 
from birth to about the age of fourteen. During this period, the child begins 
to develop a self concept. Through observation of parents and other adults in 
the family circle, he first becomes acquainted with different roles that an 
individual may play in our culture. As he grows older, his available role models 
are augmented through his contacts in school and in other situations outside 
the home. Still other roles may be encountered vicariously in books, movies, or 
television, 
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So we can see that the information we get 
from a forecast or prediction is only one fact 
that goes into making a wise decision. A lot 
does depend on the chances in our favor or 
against us. But a lot also depends on the 
value or importance of our goals; and a lot 
depends on the seriousness or danger of our 
risks. 

A trackman who has jumped 21 feet in 8 out 
of 10 tries would be foolish to try to jump 
across a 20-foot chasm just to pick a few blue- 
berries on the other side. But what if he were 
being chased by an angry grizzly bear? 


So in deciding which high school course to 
take, or in setting goals for your future educa- 
tion and occupation, you will want to consider 
not only your chances of success or failure. 
You will want to consider also how serious it 
would be to fail and how important it 
would he to succeed. 


Fig. 112 Excerpt from You: Today and Tomorrow—a group guidance 
program for junior high school students. (From Katz, 35, p. 34.) 


From an early age, the child begins to “try on” different roles in play or in 
fantasy. He may become in turn a cowboy, policeman, fireman, or astronaut. 
At this level, needs are uppermost in his selection of roles. It is only later that 
he begins to consider interests, then abilities, and still later job requirements 
and opportunities. The apparently irrational occupational roles assumed in 
childhood make good sense when examined within their own context. Typi- 
cally, the young child is dominated by adults who restrict his freedom of action 
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and protect him from danger. Against such a background, his needs for 
independence, for power over others, and for adventure find outlets in play. 
If asked what he wants to be when he grows up, the child is also likely to 


name an occupation within his limited repertory of knowledge that seems to 
fulfill these needs. 

In recognizing that an occupational role can help to meet his needs, the child 
has tsken the first step in his vocational development. With time, he finds 
increasing opportunities for realistic tryout of vocational roles. As he (or she) 
carries out household chores, works in the neighbor's yard, baby-sits, tinkers 
with mechanical gadgets, plays baseball, takes violin lessons, or studies differ- 


ent subjects in school, the process of reality testing begins. 

The second stage proposed by Super is that of exploration, covering adoles- 
cence and early adulthood. This is a period of extensive reality testing, with 
consequent modification of the self concept. It is at this stage that important 
educational decisions are first faced and vocational goals are first seriously 
exainined. For these reasons, vocational counseling has traditionally focused on 
this period, At this stage the individual also undertakes more formal work- 
role tryouts through part-time, temporary, and trial jobs. 

The third stage is that of establishment. After some preliminary trial-and- 
error and floundering, most persons show a tendency to “settle down” in a 
particular line of work. Stabilization and advancement are typical of this stage. 
The individual comes to identify with his chosen field of work; he accumulates 
experience that ties him more closely to that field; and he assimilates the gen- 
eral way of life associated with his job. The fourth stage is one of maintenance. 
Continuing in the same field of work, the individual at middle age typically 
concentrates on retaining the position he has attained on the job, at home, 
and in the community. He either enjoys the fruits of his labor or accepts his 
frustrations. This stage leads finally to occupational decline, as indicated by 
teduction in vocational activity and eventual retirement. ; 

Like any system of stages employed to describe human development, this 
schema is admittedly an oversimplification. While helping to focus attention 
upon developmental sequences and upon tendencies characterizing different 
life periods, it is limited in a number of ways. First, the developmental process 
is continuous, and any description of it in terms of discrete stages is only an 
approximation. Second, there are wide individual differences in the time of 
onset and duration of each stage. In some persons, one or more stages may be 
skipped altogether. Individual career patterns, for example, vary in the number 
and nature of job shifts (40; 59, ch. 5). These differences in mobility are also 
associated with the nature of the work. Thus for semiskilled or clerical work, 
changing jobs usually entails no loss and may represent a gain. For work 
requiring a long period of professional preparation or for jobs in which con- 
siderable premium is put on experience, on the other hand, earlier decisions 
are more nearly irrevocable, and the typical career pattern is very. stable. 
Similar differences may be found at later stages. Some persons continue to 
advance in their work throughout life. For them, there never comes a time 
for mere maintenance of the status quo or for decline in productivity. 

When these limitations are borne in mind, the above stages can provide a 
convenient framework for organizing a mass of discrete facts. They can serve 
to establish norms against which individual development can be evaluated. 
And they hel p to generate testable hypotheses for research. 
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A related concept is that of developmental tasks (17, 27, 61). In vocational 
development as in other forms of human development, the individual cn 
counters typical behavioral demands and problems at different life stages from 
the preschool period to the age of retirement. Although both the problems and 
the appropriate reactions vary somewhat among cultures and subcultures, 
modal requirements can be specified within a given cultural setting. Each of 
the vocational life stages described above makes characteristic demands upon 
the individual. Mastery of the developmental tasks of earlier stages influences 
the individual's handling of the behavioral demands of the next. 

Super (61) has used this concept of developmental tasks in his treatment of 
vocational maturity. The individual's vocational maturity is measured in terms 
of his mastery of the vocational tasks of his age level or his effectiveness in 
coping with the characteristic problems of his life stage. For example, among 
the vocational developmental tasks of an elementary schoolchild are the choice 
of daily activities suited to one's capacities, the assumption of responsibility 
for one's acts, and the undertaking of cooperative enterprises. Vocational de- 
velopmental tasks of a man in his sixties might include, among others, planning 
for any necessary curtailment of work load or change of functions, as well as 
realistic consideration of retirement prospects. Vocational adjustment at any 
stage is the resultant of interaction between the individual's vocational maturity 
and the reality demands of the situations he encounters. Thus, while vocational 
maturity is defined in terms of the individual's behavior, his vocational adjust- 
ment is defined in terms of the outcomes of his behavior (59, p. 187). At any 
stage of vocational development, moreover, vocational maturity implies sufficient 
flexibility to permit the individual to modify his occupational activities and 
life style to meet the demands of changing personal and societal situations. 

Career Development Studies. The concepts of developmental tasks and 
vocational maturity provided the theoretical framework for a 20-year longi- 
tudinal investigation of approximately one hundred ninth-grade boys, initiated 
by Super and his associates (61, 63). The subjects were all attending a public 
high school in an urban community in INew York State, chosen as typical of 
American cities in many socioeconomic indices. A mass of data were gathered 
through tests, questionnaires, and a series of four semistructured interviews 
with each subject. Further data were obtained from records of school achieve- 
ment, reports of extracurricular and community activities, peer ratings, inter- 
views with parents, and other miscellaneous sources. Several indices of voca- 
tional development were formulated and applied to the group. The resulting 
measures were intercorrelated and factor analyzed. On the basis of these 
analyses, the following indices were identified as the best measures of the 
construct "vocational maturity" at this age level: concern with choice (aware- 
ness of need for choice and knowledge of factors affecting choice), acceptance 
of responsibility for choice and planning, specificity of planning for the pre- 
ferred occupation, and use of resources in obtaining occupational infor- 
mation, 
| The findings suggested that the major vocational developmental task at the 
junior high school level is that of preparing to make vocational choices. For 
the ninth-grade boy, vocational maturity is shown, not by the wisdom or con- 
sistency of the ultimate vocational goal he chooses, but rather by the way he 
handles the preliminary planning and exploration required at this stage. Cor- 
relations with many personal and background variables were also found in the 
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effort to investigate factors associated with vocational maturity. As defined in 
this study, vocational maturity tended to be positively correlated with intelli- 
gence, parental occupational level, extent of cultural stimulation provided in 
the home, level of boy's own occupational choice, and enrollment in a college 
preparatory curriculum. It is thus possible that the indices chosen in this study 
may be more appropriate for assessing the vocational maturity of boys planning 
to enter higher-level occupations than for those planning to enter lower-level 
occupations. 

Despite the impressive accumulation of data, the results of this initial, cross- 
sectional analysis of data must be regarded as tentative and exploratory. The 
anticipated follow-ups of the same group of boys will undoubtedly permit a 
more definitive evaluation of initial findings. By its very nature, vocational 
maturity cannot be adequately defined without longitudinal data. Specifically 
we must know how each of the proposed indices of vocational development 
changes with age and how it correlates with measures of subsequent voca- 
tional adjustment in the individual. Otherwise we are unable to evaluate 
deviations above or below the group norm in any of the variables. 

Equally significant data should be provided by several other longitudinal 
studies of career development that are now in progress both in this country 
and elsewhere (see 6, 65). Cross-sectional studies of more limited scope are also 
helping to round out the picture. In one study of high school boys, for example, 
self-estimates of aptitudes and interests were correlated with test scores ob- 
tained on the same variables (43). These correlations were higher for twelfth- 
graders than for ninth-graders, suggesting an improvement in self-knowledge 
and an increasingly realistic orientation as the boys progressed through high 
school. It might be added parenthetically that the previously cited group 
guidance unit for the junior high school, entitled You: Today and Tomorrow 
(35), represents a direct application of the vocational development rationale 
and of the specific developmental tasks proposed by Super and his associates. 

Life-span Counseling. The increasing emphasis placed upon vocational 
development, as contrasted to single vocational choices, has been paralleled by 
à conception of vocational counseling as a continuing process. Lifespan 
counseling is now widely advocated. Such counseling is needed to help the 
child keep up with the developmental tasks of his age level in preparation for 
later vocational choices and to guide the adolescent and young adult through 
his vocational explorations and successive approximations. Following entry 
into the world of work, other decisions must be made about specific jobs 
within one's chosen field and about job changes to improve job satisfaction, to 
facilitate vocational advancement, or for any number of individual reasons. 

In special cases, a drastic reassessment of occupational choice may be re- 
quired at any age level. This is true of an individual who discovers he made 
an unsuitable choice at an earlier stage, or one whose needs and values have 
altered radically with time, or one who has suffered a physical disability through 
injury or disease. The tubercular, the heart patient, and the disabled war 
Veteran are common examples of the need for vocational counseling during 
early or late maturity. Still other examples are provided by a married woman 
who is suddenly widowed or by an older woman whose children have grown 
up and who would like to enter or reenter the world of work. 

Vocational counseling of older workers is now receiving more and more 
attention. The special vocational problems encountered at this stage may re- 
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volve around changes in the nature or amount of work done on the present 
job, transfer to a more suitable type of job, or complete retirement from work. 
Research on problems of retirement has so far concentrated on exploratory 
surveys of attitudes, plans, problems, and effects on both the individual and 
his family. The tools have been largely questionnaires and interviews (e.g. 11, 
13, 28, 66). A project conducted at Cornell introduced an important methodo- 
logical improvement by gathering longitudinal data at four points in time: 
before retirement and after intervals of two, four, and five years (56). Com- 
parisons were also made between retired workers and those still gainfully 
employed. The study included 4,032 workers in 259 industrial, governmental, 
and other organizations distributed over the entire country. Mention should 
also be made of the growing body of research on the vocational skills of older 
workers Ce.g., 74). 

Finally, it is being recognized increasingly that vocational counseling sho: d 
not be limited to cross-sectional “crisis counseling," i.e., counseling given w 


the individual faces a serious problem. With its positive and preventive orienta- 
tion, counseling should help to forestall crises. Moreover, a counselor can 
function most effectively if he maintains continuing, long-term contact with 


the same client. Super (59, ch. 23) proposes that such a relationship mig! 
comparable to the typical relationship one has with a family physician, dentist, 
minister, or lawyer. The individual seeks help when he feels the need for it 
and possibly for an occasional checkup. But on these occasions he continues to 
consult the same counselor, who thus accumulates enough background in- 
formation to be of maximum service to the client. The continuing contacts 
that a school counselor maintains with his counselees, as supplemented by 
cumulative records, approximate such a relationship. But in the counseling of 
adults, this pattern is so far largely untried, except possibly in the case of 
employee counseling in certain industrial corporations. 


SUMMARY 


A major area of research in counseling psychology concerns the process of 
occupational choice and the nature of vocational development. In selecting an 
occupation, the individual chooses a way of life, One's occupation is likely 
to affect not only his income level and general social status but also many 
specific aspects of daily life. Job satisfaction and general emotional well-being 
are contingent upon the extent to which a person’s work satisfies needs and 
values that are significant to him. There is a considerable accumulation of 
data on personality patterns characteristic of persons in different occupations, 
but the findings are often difficult to interpret because of procedural inade- 
quacies. Some research has also been done on the relation between job satisfac- 
tion and need satisfaction and on the role of childhood experiences in 
vocational choice. 

Both occupational research and vocational counseling call for occupational 
classifications based upon psychologically meaningful categories. Dimensional 
classifications have utilized the concepts of level, field, and enterprise. Trait 
patterns have been employed in other classifications, such as those followed 
in the Minnesota Occupational Rating Scales and in the forthcoming revision 
of the Dictionary of Occupational Titles. 
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Vocational choice involves, not one, but a series of decisions made over a 
period of years. It is one of the objectives of counseling to facilitate decision 
making by providing self-knowledge, occupational information, and training in 
the decision process. Occupational choice represents a synthesis between indi- 
vidual wants or needs and the demands and limitations of reality. An effective 
decision takes into account the probabilities of different outcomes, the risks 
involved, and the individual's value system. Throughout life, decisions made 
by the individual set limits upon his subsequent experiences and help to 
determine the direction of his development. Psychological research on the 
decision process, which has barely begun, should eventually make significant 
contributions to counseling psychology. 

Counseling psychologists now emphasize the continuity of vocational de- 
velopment throughout life. In describing this developmental process, Super 
identifies five stages: growth, exploration, establishment, maintenance, and 
decline. Each stage presents its characteristic developmental tasks. An indi- 
vidual’s vocational maturity is defined in terms of his mastery of the vocational 
tasks appropriate to his age. Longitudinal studies of vocational development 
have been initiated both in America and in other countries. A practical applica- 
tion of psychological knowledge regarding developmental tasks as well as 
decision making can be found in a group guidance program entitled You: 
Today and Tomorrow. There is increasing recognition of the need for life- 
span vocational counseling, since each life stage presents characteristic problems 
of its own. Counseling can be made more effective by the establishment of a 
continuing relationship between counselor and client. Vocational adjustment 
requires flexibility to reorient oneself to changing societal demands and indi- 
vidual capabilities throughout life. 


REFERENCES 


. Anastasi, Anne. Differential psychology. (3rd ed.) New York: Macmillan, 1958. 

- Anastasi, Anne. Psychological testing. (2nd ed.) New York: Macmillan, 1961. 

. Berdie, R. F. Likes, dislikes, and vocational interests. J. appl. Psychol, 1943, 27, 

180-189. 

4. Bogard, H. M. Union and management trainees: A comparative study of personal- 
ity and occupational choice. ! appl. Psychol., 1960, 44, 56-63. 

5. Bordin, E. S. A theory of vocational interests as dynamic phenomena. Educ. 
psychol. Measmt, 1943, 3, 49-65. 

6. Borow, H. Research programs in career development. J. counsel. Psychol, 1960, 
7, 62-70. eS 

7. Brendet: M. Toward a psychodynamic system of occupational classification. J. 
counsel. Psychol., 1960, 7, 96-100. 

8. Brophy, A. L. Self, role, and satisfaction. Genet. Psychol. Monogr., 1959, 59, 163- 
308, 

9. Bross, I. D. J. Design for decision. New York: Macmillan, 1953. 

10. Bühler, Charlotte. Der menschliche Lebenslauf als psychologische Problem. Stutt- 
gart: Hirzel, 1933. : i 

11. Burgess, E. W., et al. Occupational differences in attitudes toward aging and retire- 
ment. J. Geront., 1958, 19, 203-206. n TF 

12. Cleveland, S. Personality patterns associated with the professions of dietitian and 
nurse. J. Hlth hum. Behav., 1961, 2, 113-124. : 

13. Corey, L. G. Psychological adjustment and the worker role: An analysis of oc- 
cupational differences. J. appl. Psychol, 1959, 43, 253-255. 

14. Crites, J. O. Factor analytic definitions of vocational motivation. J. appl. Psychol., 

1961, 45, 330-337. 


wre 


478 


44- 


45- 


COUNSELING PSYCHOLOGY 


. Darley, J. G., and Hagenah, Theda. Vocational interest measurement: Theory and 


practice. Minneapolis: Univer. Minn. Press, 1955. 


. Edwards, W. The theory of decision making. Psychol. Bull., 1954, 51, 380-417 
. Erikson, E. H. Childhood and society. New York: Norton, 1950. 
. Fine, S. A., and Heinz, C. A. The estimates of worker trait requirements for 4090 


jobs. Personnel Guid. J., 1957, 36, 168-174. 


. Forer, B. R. Personality factors in occupational choice. Educ. psychol. Measmt, 


1953, 13, 361-366. 


. Friend, Jeannette G., and Haggard, E. A. Work adjustment in relation to family 


background. Appl. Psychol. Monogr., 1948, No. 16. 


. Ginzberg, E., et al. Occupational choice: An approach to a general theory. New 


York: Columbia Univer. Press, 1951. 


- Gribbons, W. D. Evaluation of an eighth grade group guidance program. Person- 


nel Guid. J., 1960, 38, 740-745. 


. Grigg, A. E. Childhood experience with parental attitudes: A test of Roe’s 


hypothesis. J. counsel. Psychol., 1959, 6, 153-155. 


. Guide to the use of the General Aptitude Test Battery. Section III: Development. 


Washington: Govt. Printing Office, 1962. 


. Hagen, D. Careers and family atmospheres: An empirical test of Roe's theory. J. 


counsel. Psychol., 1960, 7, 251-256. 


. Hahn, M. E., and MacLean, M. S. Counseling psychology. (2nd ed.) New York: 


McGraw-Hill, 1955. 


. Havighurst, R. J. Human development and education. New York: Longmans, 


Green, 1953. 


- Havighurst, R. J. The social competence of middle-aged people. Genet. Psychol. 


Monogr., 1957, 56, 297-375. 


. Heist, P. Personality characteristics of dental students. Educ. Rec., 1960, 41, 240- 


252. 
- Holland, J. L. A theory of vocational choice. J. counsel. Psychol., 1959, 6, 35-45. 
31. 


Holland, J. L. Some explorations of a theory of vocational choice: I. One- and 
two-year longitudinal studies. Psychol. Monogr., 1962, 76, No. 26. 


. Holt, R. R., and Luborsky, L. Personality patterns of psychiatrists: A study of 


methods for selecting residents. Vols. 1 and 2. New York: Basic Books, 1958. 


- Hoppock, R. Occupational information: Where to get it and how to use it in 


counseling and teaching. New York: McGraw-Hill, 1957. 


- Jacobson, F. N., Rettig, S., and Pasamanick, B. Status, job satisfaction, and factors 


of job satisfaction of state institution and clinic psychologists. Amer. Psychologist, 
1959, 14, 144—150. 
tz, M. R. You: Today and tomorrow. Princeton, N.J.: Educ. Test. Serv., 1959. 


- Kirchner, W. K., Dunnette, M. S., and Mousley, N. Use of the Edwards 


Personal Preference Schedule in the selection of salesmen. Personnel Psychol., 
1960, 13, 421—424. 


- MacKinnon, D. W. Satisfaction with self as person and in professional role. 


Amer. Psychologist, 1961, 16, 365. 


- Merton, R. K., Reader, G. G., and Kendall, Patricia L. (Eds.) The student 


physician: Introductory studies in the sociology of medical education. Cambridge, 
Mass.: Harvard Univer. Press, 1957. 


- Merwin, J. C., and DiVesta, F. J. A study of need theory and career choice. 


]. counsel. Psychol., 1959, 6, 302-308. 


. Miller, D. C., and Form, W. H. Industrial sociology. New York: Harper & Row, 


1951. 


: Minor, C. A., and Neel, R. G. The relationship between achievement motive and 


occupational preference. J. counsel. Psychol., 1958, 5, 39-43. 


. Oakley, C. A., Macrae, A., and Mercer, E. O. Handbook of vocational guidance. 


London: Univer. London Press, 1937. 


. O'Hara, R. P., and Tiedeman, D. V. Vocational self concept in adolescence. J. 


counsel. Psychol., 1959, 6, 292-301. 

Paterson, D. G., Gerken, C. d'A., and Hahn, M. E. Revised Minnesota Occupational 
Rating Scales. Minneapolis: Univer. Minn. Press, 1953. 

Rettig, S. Status and job satisfaction of the professional: A factor analysis. Psychol. 
Rep., 1960, 6, 411-413. 


479 


Psychology of Vocational Choice 


. Rettig, S., Jacobson, F. N., and Pasamanick, B. The motivational pattem of the 


mental health professional. Psychiat. Res. Rep. (Amer. Psychiat. Assoc.), 1958, No. 
1 


7. Roe, Anne. A psychological study of eminent biologists. Psychol. Monogr., 1951, 


65, No. 14. 


. Roe, Anne. A psychological study of physical scientists. Genet. Psychol. Monogr., 


1951, 43, 121-235. 


. Roe, Anne. The psychology of occupations. New York: Wiley, 1956. 
. Roe, Anne. Early determinants of vocational choice. J. counsel. Psychol, 1957, 


4, 212-217. 


. Schaffer, R. H. Job satisfaction as related to need satisfaction in work. Psychol. 


Monogr., 1953, 67, No. 14. 


. Segal, S. J. A psychoanalytic analysis of personality factors in vocational choice. 


J. counsel. Psychol, 1961, 8, 202—210. 


. Siegelman, M., and Peck, R. F. Personality patterns related to occupational roles. 


Genet. Psychol. Monogr., 1960, 61, 291-349. 


. Small, L. Personality determinants of vocational choice. Psychol. Monogr., 1953, 


67, No. 1. 


. Stern, G. G., et al. Two scales for the assessment of unconscious motivations for 


teaching. Educ. psychol. Measmt, 1960, 20, 9-29. 


. Streib, G. F., Thompson, W. E., and Suchman, E. A. The Cornell study of oc- 


cupational retirement. J. soc. Issues, 1958, 14 (2), 3-17. 


. Strong, E. K., Jr. Vocational interests of men and women. Stanford, Calif.: Stan- 


ford Univer. Press, 1943. 


. Super, D. E. A theory of vocational development. Amer. Psychologist, 1953, 8, 


185—190. 


. Super, D. E. The psychology of careers; an introduction to vocational development. 


New York: Harper & Row, 1957. 


. Super, D. E. Vocational adjustment in terms of role theory. Voc. Guid. Quart., 


1957, 5, 139-141. 


. Super, D. E., et al. Vocational development: A framework for research. New York: 


Teach. Coll., Columbia Univer., Bur. Publ., 1957. 


. Super, D. E., and Bachrach, P. B. Scientific careers and vocational development 


theory: A review, a critique, and some recommendations. New York: Teach. Coll., 
Columbia Univer, Bur. Publ, 1957. 


3. Super, D. E., and Overstreet, Phoebe L. The vocational maturity of ninth grade 


boys. New York: Teach. Coll, Columbia Univer, Bur. Publ, 1960. 


. Terman, L. M. Scientists and non-scientists in a group of 800 gifted men. 


Psychol. Monogr., 1954, 68, No. 7. 


. Tiedeman, D. V., and O'Hara, R. P. The Harvard studies in career development: 


In retrospect and in prospect. Harvard Stud. Career Develpm., 1960, No. 15a. 
(Grad Sch. Educ., Harvard Univer.—Mimeographed) 


- Tuckman, J., and Lorge, I. Retirement and the industrial worker: Prospect and 


reality. New York: Teach. Coll., Columbia Univer., Bur. Publ., 1953. 


. Tyler, Leona E. Distinctive patterns of likes and dislikes over a twenty-two year 


period. J. counsel. Psychol., 1959, 6, 234-237- 


. Tyler, Leona E. Toward a workable psychology of individuality. Amer. Psychologist, 


1959, 14, 75-81. 


- Tyler, Leona E. The work of the counselor. (2nd ed.) New York: Appleton-Cen- 


tury-Crofts, 1961. 


- United States Employment Service. Dictionary of occupational titles. (2nd ed.) 


Vol. 1. Washington: Govt. Printing Office, 1949. (Suppl. 1955.) 


. Utton, A. C. Recalled parent-child relations as determinants of vocational choice. 


Unpublished doctoral dissertation, Teach. Coll., Columbia Univer., 1960. 


. Viteles, M. S., Brayfield, A. H., and Tyler, Leona E. Vocational counseling: A 


reappraisal in honor of Donald G. Paterson. Minneapolis: Univer. Minn. Press, 
1961. 


- Walther, R. H. Self-description and occupational choice. Amer. Psychologist, 1961, 


16, 429. 


- Welford, A. T., et al. Skill and age; an experimental approach. London: Oxford 


Univer. Press, 1951. 


VII 


Psychology and 
Other Professions 


I8 


Psychology and Education 


In the next three chapters we shall consider certain applications of psychology 
to other professional fields. In these roles, the psychologist works predominantly 
(though not exclusively) through other professional personnel rather than 
directly with clients, applicants, employees, students, or other recipients of his 
services. He may function as a consultant, making recommendations that are 
implemented by others. He may be engaged in the training of professional 
personnel in schools of education, medicine, business, home economics, social 
work, nursing, theology, and other specialties. Finally, he may conduct joint 
research with members of another profession. The interprofessional contacts 
of any one psychologist may of course cover one or more of these three types of 
relationships. 

To be sure, some psychologists employed in schools, courts, hospitals, and 
other institutions do work directly with clients and may even devote most of 
their time to such immediate personal services. When they do so, however, 
their functions are so similar to those of clinical or counseling psychologists as 
to be virtually indistinguishable from them. In the present section, therefore, 
the focus will be on the interprofessional activities of psychologists working 
in these settings. 

Mention should also be made of the extensive contributions of psychologists 
to testing programs for the selection of professional school students, the 
specialty certification of professional personnel, and the evaluation of applicants 
for professional posts. Examples of large-scale programs developed and ad- 
ministered by psychologically trained test specialists are provided by the 
Medical College Admission Test, the Law School Admission Test, and the 
National Teacher Examinations, among others (see 3, pp. 472-485). In these 
programs, the psychologist functions in the same way as he does in any per- 
sonnel selection situation. Since the problems and techniques of personnel 
selection were discussed in Chapters 2, 3, and 4, nothing further need be said 
about this role of psychologists in relation to other professions. — 

Although psychologists have established professional relationships with many 
fields (see 146, chs. 8 and 11), three have been singled out for special dis- 
cussion because of their present or potential scope. These are the fields of 
education, medicine, and law, each of which will be examined in a separate 
chapter. Even in these fields, however, the contributions of psychology are 
not so fully developed or so highly structured as they are in the other psycho- 
logical specialties discussed earlier in the book. The roles of psychologists in 
these fields are in a state of flux and will undoubtedly undergo considerable 
reorganization and clarification in the years ahead. . d 

Psychologists who work in the field of education can be differentiated into 
school psychologists and educational psychologists. Although the two groups 
have much in common in training and functions, their distinctness was 
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recognized by the establishment of two separate divisions in the American 
Psychological Association, devoted to these two specialties (see Appendix A). 
Similarly, some universities now offer separate training programs leading to 
graduate degrees in the two fields. The University of Chicago, for example, 
initiated these differentiated programs in 1962. 

The functions of the school psychologist have much in common with those 
of counseling and clinical psychologists but are performed exclusively in a 
school setting. The educational psychologist, on the other hand, is most likely 
to be found on the faculty of a school of education or in an educational re- 
search bureau. As in the distinction between clinical and counseling psychol- 
ogy, that between school psychology and educational psychology is one of 
degree. The focus of activity in the two specialties can be clearly differentiated, 
but there is much overlapping of functions and individual variation. The first 
section of this chapter is concerned with the work of the school psychologist. 
The remaining three sections, dealing with teacher training, educations! re- 
search, and the educational applications of teaching machines and other ‘ech- 
nological developments, fall within the area of educational psychology. 


THE SCHOOL PSYCHOLOGIST 


Functions. The functions and responsibilities of a school psychologist vary 
widely with the school and the community in which he is employed. Never- 
theless, questionnaire surveys, national conferences, and published descriptions 
of these functions reveal certain areas of general agreement (see 29; 48, ch. 1; 
68; 98; 146, ch. 8; 147). The most common function is one of diagnosis, 
broadly defined. This involves individual intelligence and personality testing; 
fact-finding interviews with the pupil, as well as with teachers, parents, and 
others who have significant contacts with him; observation of the pupil in the 
classroom, on the playground, and in other school contexts; and study of the 
pupil's cumulative school record. 

These diagnostic procedures are followed by recommendations for remedial 
action, which are usually carried out by other professional personnel, such as 
the child's classroom teacher or the school social worker. In some cases, the 
psychologist may recommend remedial instruction to make up for special 
educational deficiencies, as in reading or arithmetic. In still other cases, he may 
recommend placement in a special class or special school. Conferences with 
parents may also be utilized in the effort to improve the child’s home situation 
and parental relations. In all these procedures, it is apparent that the school 
psychologist relies heavily upon environmental therapy (see Chapter 14). 

A major source of current controversy regarding the functions of the school 
psychologist pertains to psychotherapy. Some school psychologists maintain that 
they should also conduct psychotherapy with individual children, especially 
when community and other clinical resources are inadequate. On the other 
hand, a majority of spokesmen for school psychology deemphasize this role. 
All agree, of course, that any contact between the psychologist and the child 
may and should serve a therapeutic function. The disagreement centers around 
more intensive, long-term psychotherapy. 

Apart from the various arguments that have been mustered for and against 
psychotherapy by school psychologists, there is the inescapable fact that the 
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school psychologist rarely has the time required to carry out this function 
unless he concentrates on a few seriously disturbed cases and neglects all other 
obligations. It has been estimated that to provide one school psychologist for 
every three thousand schoolchildren in the nation would require some fifteen 
thousand school psychologists (146, p. 167). The number currently available 
is close to one-tenth of that figure. To be sure, school psychologists are quite 
unevenly distributed among school systems. But even in those systems that are 
most liberally provided with such services, the available number of psychol- 
ogists is not such as to permit extensive individual psychotherapy. When 
school psychologists work in this capacity, they do so chiefly as an auxiliary 
or emergency measure. The bulk of their contribution is made through other 
professional personnel. 

Even if the available supply of school psychologists were to be greatly 
augmented, it would still be desirable for them to work principally through 
teachers and others who have daily contact with the children. Such an ap- 
proach is in keeping with the objectives of the mental health movement in its 
emphasis upon prevention and upon the fostering of effective psychological 
development through good teaching practices (1). In school as elsewhere, men- 
tal health is not something to be acquired in a special department or at certain 
hours, but is a by-product of one’s experiences in daily living. Successful school 
achievement, for example, is as important to the mental health of the school- 
child or college student as is successful job performance to the mental health 
of an adult worker, 

Depending upon the needs of a particular school system and the availability 
of other personnel, school psychologists may and frequently do perform a 
number of other functions. They may assist in organizing and administering 
the school’s group testing program. Because of their special training, they can 
also be of help in the interpretation of test results and in the evaluation and 
selection of new tests. Frequently they conduct in-service training of teachers 
in such areas as test interpretation, detection of emotional problems in children, 
giving of “psychological first aid” in the classroom, and application of the 
mental health point of view to teaching. They can also make a contribution to 
curriculum development, to the improvement of cumulative records, and to 
various administrative procedures. Often they are called upon to participate in 
PTA meetings, address various civic groups, and perform other public- 
information functions. In some schools, psychologists conduct remedial train- 
ing programs in basic educational skills, although the desirability of their 
doing so has been questioned on the grounds that they can use their time more 
effectively. Special teachers trained in remedial work should carry on this in- 
struction, Opinion is also divided as to whether school psychologists should 
engage in research as a part of their regular duties. For many school psycholo- 
gists this may be an academic question because of time pressures. 

Referral Problems and Sources. The kinds of problems for which 
children are referred to a school psychologist are quite varied. The most visible 
type is the behavior problem of an aggressive or ‘disruptive nature that inter- 
feres with the normal operation of the school. Other kinds of emotional ad- 
justment problems are also a common cause of referral. Learning difficulties 
of a general or specific nature, as in reading or arithmetic, are another frequent 
complaint. Some cases are referred because of physical handicaps that require 
special placement or adjustment in instructional techniques. The psychologist 
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may be consulted, too, with regard to gifted children whose educational pro- 
grams should be enriched or accelerated to suit their abilities. 

Although the classroom teacher is the most common referral source, other 
school personnel may have opportunities to detect problems and to make refer- 
rals or seek the advice of the school psychologist. Among them are the school 
nurse and physician, the social worker, and special teachers. At the high school 
level, self-referrals by students are not unusual. Other ways of identifying 
individuals in need of help include group screening procedures utilizing 
teachers’ ratings or screening guides, as well as checklists or inventories filled 
out by pupils. When such group procedures are followed by screening inter- 
views, they will usually help to detect problem cases not otherwise identified. 
Visits by the school psychologist to classes and to other school activities may 
also bring to his attention pupils who are not adjusting satisfactorily. Cumula- 
tive records and group testing programs can help to identify pupils with learn- 
ing problems and with special talents or defects. 

Training. With regard to training, the school psychologist must cut across 
the fields of psychology and education. Although there are a few dissenting 
voices, it is the general consensus that he is not merely a clinical psychologist 
who works in a school building. He works not only in a school but through 
it. He serves as a consultant to educational personnel, and he utilizes the school 
environment as a therapeutic medium. For these reasons, he needs an under- 
standing of the educational process and some knowledge about the organiza- 
tion and administration of school systems, their available resources, and the 
roles of their specialized personnel. 

Within psychology, he should have a firm foundation in the general science 
of behavior, with special training in such areas as child or developmental 
psychology, personality adjustment, psychological testing, and clinical and 
counseling methods. Whether conducted in a department of psychology, a de- 
partment of education, or jointly in both departments, university training pro 
grams for school psychology require the cooperative services of psychology and 
education faculty. It is particularly important that in his practicum training the 
school psychologist have adequate opportunity to work with children. Train 
ing in clinical psychology is often limited to adult cases, unless special efforts 
are made to include experience with children (see 97; 147, ch. 2). Teaching 
experience has also been recommended, although other types of practicum 
experience in the schools may provide a realistic orientation to the sort of 
environment in which the school psychologist will work. 
hor ee Legal Status. : As a professional specialty, school psychology 

g rapid strides since midcentury. In 1948, the APA Division of 
School Psychologists had 88 members; in 1963, this number had grown to 856, 
representing nearly a tenfold increase over a 15-year period. Since a sizable 
proportion of APA members are not affiliated with any division and a number 
of school psychologists may not even be members of the APA, it has been 
estimated that the total number of psychologists functioning in this capacity 
may be of the order of 1,500 (146, p. 167). This number, of course, is still far 
below even the most conservative estimates of current need (29). 

Approximately half of the states now have certification laws for school psy 
chologists (63). The educational requirements for certification vary from the 
M.A. to the Ph.D. level. Among psychologists themselves, it is recognized that 
school psychologists should have doctoral level training, although the M-A- 
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level is sufficient for the more routine testing functions performed by school 
psychologists in some school systems (29). It is recommended, however, that 
the latter job level be designated by a more restricted job title, such as psy- 
chometrist, psychometric technician, or psychological examiner. 

Relation to Guidance Counselors. In one respect, the functions of a 
school psychologist are more similar to those of clinical psychologists than to 
those of counseling psychologists: the school psychologist does not normally 
provide vocational orientation or educational and vocational counseling. In 
most scheol systems, these functions are handled by guidance counselors. 
Parenthetically, one wonders why such a redundant term was chosen to 
describe this function. Since "guidance" and "counseling" are often used 
synonymously, calling someone a “guidance counselor" has been likened to 
naming a girl "Mary Marie" (146, p. 168). Having become deeply entrenched 
in educational nomenclature, however, this term will undoubtedly persist. 

Guidance counselors are rarely trained as counseling psychologists. Training 
requirements for this position vary widely from state to state and are often 
quite limited. In some states, in fact, guidance counselors may be classroom 
teachers with no formal preparation or special experience in counseling func- 
tions. To help remedy some of these limitations in training, the National 
Defense Education Act of 1958 appropriated funds for short intensive training 
institutes for school guidance counselors (140). Many such institutes have now 
been conducted throughout the country. Whatever his level of training and 
sophistication, the guidance counselor represents one more type of specialized 
educational personnel with whom the school psychologist needs to maintain 
effective working relations (see 147, pp. 257-260). 


PSYCHOLOGY IN TEACHER TRAINING 


Objectives. Courses in educational psychology are designed to augment 
the teacher's understanding of human behavior, rather than to provide specific 
“rules” for effective teaching. Psychological training should make available to 
prospective teachers knowledge about child behavior, learning processes, ad- 
justment mechanisms, and other relevant areas. Such information should help 
the teacher to make a wiser choice of teaching procedures when confronted 
with specific situations, As in other fields of application, psychological training 
should provide the teacher with a point of view toward human behavior and a 
method for solving interpersonal problems as they arise. It should help him to 
decide what to look for and what questions to ask in trying to understand 
situations encountered in the classroom. 3 

Such an approach is in sharp contrast to the learning of prescribed teaching 
techniques. There are many reasons why a set of rules or techniques cannot 
Serve as a satisfactory guide to effective teaching. Teaching situations are 
complex, and each is unique in its totality. Moreover, teaching procedures 
very probably interact with teacher variables, as well as with learner and task 
variables. A procedure effective in the hands of one teacher may fail dismally 
when attempted by another. Since teaching is basically an interpersonal rela- 
tionship, there is good reason to expect the same sort of interaction between 
teaching method and teacher characteristics that is found between therapeutic 
techniques and therapist variables in clinical psychology (see Chapter 15). 


PSYCHOLOGY AND OTHER PROFESSIONS 


Classroom atmosphere depends largely upon the teacher. If the teacher feels 
insincere, hypocritical, or uncomfortable with the procedures he employs, the 
classroom atmosphere suffers as a consequence. Recognizing the importance 
of these conditions for teacher effectiveness, Rogers writes: 


To know when he has reached the limit of his own internal comfort, and to feel 
easy in refusing to function in ways that are not comfortable for him, is another 
aspect of the genuineness of his attitudes. If he behaves in some way that is not 
natural for him, simply because he feels that he should do so, this is quickly sensed 
by the group and damages the group atmosphere (115, p. 401). 


Still another type of interpersonal relation that may be recalled in this con- 
nection is that between supervisor and employees, discussed in Chapter 6. 
There, too, no set of rules could be universally applied, because of the inter- 
actions between supervisor, worker, and situational variables. Also relevant 
are the findings on leadership climate, considered in Chapters 5 and 6. As in 
the case of industrial supervision, the leadership climate in which teachers 
themselves must function is likely to influence their classroom behavior. If 
permissive and pupil-centered teaching procedures have been authoritatively 
even though subtly imposed upon a teacher by his own supervisor, their 
application can hardly be genuine and whole-hearted. 

The Psychology of Learning. Although emphasis on different topics has 
shifted from time to time, most educational psychology texts today focus on 
learning processes (see, e.g., 26, 37, 74, 78, 84, 107). It is obvious that the 
effective management of learning is the central problem of education. To be 
sure, information from several other areas of psychology is relevant; but the 
value of such information to the educator stems chiefly from its application to 
the facilitation and guidance of learning. It should be noted, too, that learning 
covers much more than what is formally taught in school. The pupil, whether 
he be academically successful or unsuccessful, is constantly learning many 
things, such as attitudes toward peers and authority figures, habits of fair play 
and sportsmanship, neurotic symptoms, decision-making techniques, prejudices 
and superstitions, and a host of other reactions that may be desirable or un 
desirable. The teacher needs to be alerted to the vast multiplicity of learning 
that goes on in the school situation. An understanding of learning processes 
is thus relevant to every aspect of education, from the fostering of mental 
health to the choice of specific methods for teaching reading or arithmetic. 

The potential contribution that psychology can make to education is still 
largely unrealized. Oddly enough, the gap between the two fields is widest 
in the area of learning, where the greatest contribution would be expected 
(see 32, 46, 55, 91, 126). From time to time, psychologists have attempte 
to survey what psychology has to offer to education (e.g., 11, 130). To a large 
extent, every educational psychology text tries also to do this. Insofar as these 
books undertake to give a coordinated picture of recommended teaching prac 
tices, however, they fill in extensively from personal opinions of educational 
leaders, educational folklore, and other experimentally unverified sources. 

Another regrettable tendency in educational psychology is that of drawing 
unwarranted conclusions and overgeneralizing from isolated laboratory experi- 
ments. Most laboratory investigations of both animal and human learning, for 
example, were designed to contribute toward the development of learning 
theory. When examined in isolation and outside of this context, they often 
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appear trivial. It is only when they are considered in relation to other experi- 
ments and within their appropriate theoretical framework that their signifi- 
cance can be recognized. Certainly such experiments were not designed to 
test Cor support!) the effectiveness of specific teaching procedures. What is 
needed for the latter purpose is applied research on school learning, similar to 
the research on military training problems initiated during World War II 
(see Chapter 5). 

‘There is relatively little well-designed learning research that is explicitly 
oriented toward education, The reasons for this deficit can be found partly 
in psychology and partly in education (see 91, 138). The confusion between 
basic and applied research in this field has also served to delay progress. 
Melton (91) has pointed out that education bears the same relation to psy- 
chology that engineering bears to the physical sciences. Bridging the gap be- 
tween education and psychology, the educational psychologist must design the 
curricula and teaching procedures that utilize the science of learning and 
must then carry out the necessary applied research to test the effectiveness 
of these procedures in school situations. 

To be sure, there is a mass of published educational research on such topics 
as reading and arithmetic and on comparative evaluations of different methods 
of teaching these subjects (see 18, 49). The results are often inconclusive, 
however, owing to inadequacies in experimental design, control of conditions, 
or statistical analysis of data. Because of the many variables that operate in a 
classroom situation and because the relative effectiveness of teaching methods 
may vary with learner characteristics and with the objectives of instruction, 
complex experimental designs are needed for a conclusive evaluation of teach- 
ing methods. 

A knowledge of learning theory and of laboratory experiments on learning 
can contribute in at least two ways to the development of an empirically based 
educational technology. First, it can provide methodological guidance in the 
planning and execution of applied research in educational contexts. Second, 
the learning principles identified in the laboratory can lead to a deeper under- 
standing of school learning. Out of such understanding may come suggestions 
for specific educational procedures to be evaluated through applied classroom 
research. The learning principles cited in Chapter 5 in connection with indus- 
trial and military training are also relevant to school learning. They pertain 
to such questions as motivation, stress and anxiety, active learner participation, 
feedback and immediacy of reinforcement, and the distribution of learning, 
among others, x 

Since education is expected to equip the individual to meet effectively situ- 
ations encountered in later life, transfer of training is of primary concern to 
the educator. Of particular interest are the conditions that facilitate transfer, 
such as the presentation of the learning task in a variety of contexts and the 
explicit formulation of concepts and principles. There is some evidence sug: 
gesting that principles worked out by the learner himself transfer more readily 
to other situations than do principles that are given to him (58, 73, 127). The 
optimal combination of guidance and empirical discovery, however, undoubted- 
ly varies with task variables, learner variables, and learning objectives (41, 73, 
77, 109). . 

Education deals largely with the learning of meaningful verbal materials, as 
contrasted with motor learning or rote verbal learning. There has recently 
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been an upsurge of research, both basic and applied, on the nature of mean- 
ingfulness, its effect on learning and transfer, and ways of increasing its utiliza- 
tion in the learning process (5, 6, 22, 43, 100, 142, 143). Also relevant are 
studies on concept formation in children as a function of task and learner 
variables (e.g., 101, 102). 

The importance of motivation in the learning process is generally recog- 
nized. The need for complex experimental designs that permit the assessment 
of interaction effects is well illustrated by a recent Swedish investigation of 
the effects of praise and blame on schoolchildren's achievement in arithmetic 
(69). In this experiment, the relative influence of praise and blame was found 
to vary with the child's initial performance level, his anxiety level, his atti 
tudes toward the teacher, and the type of task (whether mechanical computa- 
tion or problem solving). 

Still another major concern of educational psychologists is represented by 
the concept of learning readiness. Essentially, "readiness" refers to the pres 
ence of prerequisite skills and knowledge that enable the learner to profit 
maximally from a certain kind of instruction. Individual differences in the 
reading readiness of first-grade schoolchildren provide a familiar example. At 
one time, readiness was considered largely in terms of maturation. To be sure, 
the attainment of certain minimum physical prerequisites facilitates some 
kinds of learning. Unless he can make the necessary auditory discriminations, 
the child cannot learn to speak by the usual procedures; without the ability 
for fine motor coordination, he cannot manipulate a pencil in writing. Most 
school learning, however, is not so closely bound up with sensorimotor de- 
velopment. In the mastery of educational tasks, the importance of prior lcarn- 
ing has come to be increasingly recognized (12). More and more emphasis 
is being placed upon the hierarchical development of knowledges and skills, 
whereby the acquisition of simpler concepts equips the individual for the 
learning of more complex concepts at any age. 

Of particular interest in connection with readiness is a series of experi- 
ments by Harlow (57) on learning sets. These experiments, conducted on 
both monkeys and children, demonstrated that, when learning a series of 
similar problems, the individual gradually establishes certain learning sets 
that facilitate the learning of the required discriminations. Through such an 
experience, the individual is thus “learning how to learn." This concept is 
especially relevant to education, so much of which is designed to equip the 
individual for later learning. Through this approach, readiness can be seen 
as a special application of transfer of training, since it reflects the facilitating 
effect of prior learning upon present learning. In human learning, the ac 
quisition of effective learning procedures can be accelerated by guidance 
from the teacher. “How to study” programs are aimed at the same objective: 

Since midcentury, there is evidence that the gap between laboratory re 
search on learning and educational technology is narrowing. Psychologists 
and educators, acting both as individuals and through their professional as 
sociations, have been making special efforts to bring about closer contacts be- 
tween the two fields. Books are beginning to appear that interpret learning 
theory for educators and concentrate on the implications of learning prim 
ciples for school instruction (e.g, 50). The rising interest in education 45 
a factor in national defense and the growing demand for scientists and other 
highly trained personnel has led to increased government support of educa 
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tional research. All this activity is beginning to bear fruit in some compre- 
hensive and welldesigned research projects on school learning. Eventually, 
the results of these studies should provide a more solid foundation for the 
teaching of educational psychology. 

Other Areas of Psychology. Prospective teachers generally receive prep- 
aration in certain other areas of psychology, either in separate courses 
or as part of a comprehensive course in educational psychology. Develop- 
mental, child, or adolescent psychology is designed to orient teachers toward 
developmental changes in behavior and to familiarize them with the particu- 
lar age groups they expect to teach. A closely related area is that of individual 
and group differences. Knowledge regarding the nature, origins, and extent 
of individual differences helps in understanding the behavior of individual 
children in the classroom, Educators also need to consider the implications 
of physical and psychological sex differences and of the sex roles institution- 
alized by a particular culture (2, ch. 14; 79; 96). In recent years, too, there 
has been a growing concern with the educational problems of children from 
culturally deprived backgrounds. In many parts of the country, a number of 
special remedial and developmental programs have been instituted for such 
children, ranging from the preschool to the high school level. 

The teacher also needs to be acquainted with the special problems and 
educational needs of several types of “exceptional children,” including the 
intellectually gifted (38, 61, 145), the creative (45, 131), the mentally re- 
tarded, the physically handicapped, and the emotionally disturbed (56, 72, 
75, 76, 117). Many school systems have special classes for children in these 
various categories, taught by specially qualified teachers. Such teachers are 
usually required to have more training in psychology than are regular class- 
room teachers. 

To function effectively in the modern school, the teacher must have con- 
siderable familiarity with psychological testing, with special reference to 
group intelligence tests and achievement tests. An elementary knowledge of 
the nature and interpretation of test scores and simple statistical procedures 
for handling such scores is also helpful. Techniques for developing and 
evaluating classroom examinations are generally taught in this connection 
too. Several current texts on measurement and evaluation have been specially 
written for use in teacher training (e.g., 44, 134). 

Modern education has for some time included among its major goals the 
development of a healthy personality and the prevention of emotional mal- 
adjustment. The current mental health movement has stimulated interest in 
a reexamination of these goals (1; 33; 65; 66, part 6; 82; 121, pp. 552-566). 
In its report, the Joint Commission on Mental Illness and Health (see Chap- 
ter 14) devoted a separate volume to the role of the schools in mental health 
(1). It is generally recognized that teachers need an elementary introduction 
to the psychology of adjustment, including common adjustment mechanisms 
and the nature and sources of mild behavioral abnormalities. This back- 
ground will help them in identifying children for referral to special services 
as well as in administering “psychological first aid” in the classrom. It should 
also assist them in establishing effective teacher-pupil relations and in main- 
taining a classroom atmosphere conducive to mental health. 

A specific example of psychological research that is pertinent to teacher 
training in mental health is the comprehensive investigation of the causes, 
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nature, and effects of anxiety among schoolchildren, conducted by Sarason 
and his associates (119). Of all the emotional problems of the schoolchild, 
anxiety is one of the least likely to be recognized by teachers, because of 
its minimal observable manifestations. This is particularly true of the bright 
anxious child who does acceptable or even superior school work, although he 
could do still better if he were free from his disruptive anxieties. 

Finally, there is a growing tendency to include some social psycholozy in 
teacher training (21, 139). Examples of research areas in social psychology 
that are relevant to education include leadership, attitudes and prejudice, 
intergroup relations, competition and cooperation, interaction of individuals 
in face-to-face groups, conformity, internalization of social norms, and soci- 


ometry. 
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The term “educational research" has many meanings (7, 8, 137). Since 
the educational profession has its roots in such diverse fields as philosophy, 
history, economics, government, and psychology, its concept of research re- 
flects the variety of practices traditionally designated as research in these 
fields. Thus what the educator calls research includes many activities that 
fall outside the realm of research as defined in psychology and other natural 
sciences. 

Another source of confusion stems from the early identification of edu 

cational research in America with the compilation of educational statistics. 
Under this heading, masses of data are regularly gathered on such topics as 
the supply and demand of teachers, salaries of teachers in different parts 
of the country, distribution of pupils in schools and classes, attendance, 
and other administrative details (39, 138). It was for such record-keeping 
purposes that the Office of Education was originally established by the Federal 
government. The same objectives led to the organization of research bureaus 
and the appointment of research directors in individual school systems. An- 
other area that has often been loosely included under educational research 
is curriculum development. Even when empirical outcome studies were con- 
ducted in school settings, they were often undertaken to justify a particular 
educational program to the public rather than to test hypotheses. Such studies 
usually lacked adequate control data and their experimental designs did not 
permit. the isolation of the many variables involved. 
. Against this background, psychological research made little headway. After 
its promising beginnings early in the century through the efforts of such 
pioneers as E. L. Thorndike at Columbia and C. H. Judd at Chicago, edu- 
cational psychology made slow progress for nearly half a century. It was 
not until the late 1950s that research in this field began to show new vigor, 
as noted in the preceding section. Even now, much of the research in educa- 
tional psychology is carried on by investigators who are not themselves con- 
nected with schools. Although some of the larger school systems are becom- 
ing more receptive to such outside investigators, there are still many barriers 
to the effective utilization of schools for research purposes (34, 92. 

Within school systems themselves, there are as yet few research bureaus 
conducting well-designed psychological research. In some of the largest city 
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school systems, however, administrative research and curricular research have 
become differentiated from educational research. These three types of re- 
search may be conducted by different bureaus, as in the New York City sys- 
tem. Under these conditions, the educational research bureau can concentrate 
its activities on test construction and evaluation, learning and teaching re- 
search, and broader educational experiments. ^ 

In this section we shall look at some examples of educational research in 
which psychologists are making significant contributions. In order to give 
some notion of the range of potential contribution, specific examples have 
been chosen from widely diverse areas. Some represent highly developed 
fields of research; others concern problems on which research has barely 
begun. In their approach, the investigations fall at various points along the 
continuum of basic and applied research. Only a few examples are reported 
for purposes of illustration. Many more types of studies could be cited, in- 
cluding some of those mentioned in the preceding section. P 

Evaluation and Measurement. Traditionally, educational psychologists 
have made their greatest contribution in test construction and in the develop- 
ment of statistical methods for analyzing test results. In this area, educational 
psychologists have also attained a relatively high level of technical sophisti- 
cation. The content of such publications as the Journal of Educational Psy- 
chology and Educational and Psychological Measurement, for instance, re- 
veals both the predominant concern of educational psychologists with testing 
problems and the soundness of their procedures in this area. Besides con- 
tributing to the development and validation of tests, psychologists have con- 
ducted considerable applied and basic research on a number of factors that 
affect test performance. For instance, several studies have been concerned 
with the influence of practice and coaching on test scores (see 3, pp. 54-57). 
Another problem, with broad implications for educational practice, is that of 
“test anxiety." In a series of studies on both school children and college 
students, extensive data were gathered on the origins of this anxiety, its 
effects on test performance and on school achievement, and its relation to 
anxiety in other situations (119). - 

Not only do schools employ nearly every available kind of psychological 
test in their operational programs, but they are also the principal consumers 
of certain types of tests specially devised for educational purposes. In the con- 
struction of the latter tests, psychologists and educators work in close col- 
laboration. The effective interprofessional relations that have been established 
in test construction are in sharp contrast to those characteristic of learning 
research, 

One category of tests developed to meet a specific educational need is 
that of readiness tests (3, ch. 17; 44). These tests are designed to implement 
the previously discussed concept of educational readiness in various arcas. 
One of their chief uses is in assessing the child's qualifications for school- 
work upon school entrance. Readiness tests for this level have much in 
common with intelligence tests for children in the primary grades. In the 
readiness tests, however, special emphasis is placed upon those abilities found 
to be most important in learning to read; some attention is also given to the 
Prerequisites of numerical thinking and to the sensorimotor control required 
in learning to write. Among the specific functions covered are visual and 
auditory discrimination, motor control, verbal comprehension, vocabulary, 
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quantitative concepts, and general information. Figure 113 shows sampk 
items from two of the tests included in the Metropolitan Readiness Tests 
(62). Consisting of six tests, this battery yields separate scores in reading 
readiness, number readiness, and total readiness for first-grade schoolwork.’ 

At higher educational levels, readiness tests are commonly known as "prog 
nostic tests," but their objectives are essentially the same. By measuring pre 
requisite skills and knowledge, these tests attempt to predict how well the 
individual will perform in a particular course of study. Frequently they in 
clude the learning of sample tasks similar to those that will be encountered 
in the course. Among the most widely used prognostic tests are those de 


1 A revision of these tests is in progress. 


Fig. 113 Sample items from Metropolitan Readiness Tests. (Reproduced by 
permission of Harcourt, Brace & World.) 


Test 1. Word Meaning. In the first row, the sub; 


ject marks the baby; in th 
row, the house, ! aby; in the second 


Z E [ 


Test 4. Matching. In each row, the subject marks the picture which 
the one in the circular frame. 


is identical to 
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signed to predict achievement in mathematics, ranging from primary-grade 
arithmetic to high school algebra and geometry. Still another type of readi- 
ness test is illustrated by the Modern Language Aptitude Test (19). Designed 
to assess the capacity of an English-speaking student for learning any for- 
cign language, this test utilizes both paper-and-pencil and tape recorded 
materials. It is suitable for high school, college, and adult groups. Two of its 
subtests require the learning of orally presented numbers and visually pre- 
sented words in an artificial language. The other three parts test the indi- 
vidual's sensitivity to English grammatical structure, as well as certain word 
recognition skills with visual and auditory materials. 

OF all tests used in school settings, the most numerous are the achievement 
tests (3, chs. 16-17; 44, chs. 8 and 15-25). The principal purpose of these 
tests is to measure the effects of a course of instruction. Some are general 
achievement batteries, designed to assess the pupil's over-all progress in school- 
work. Others test for mastery of specific subjects, such as algebra, biology, 
modern history, or Spanish. In keeping with modern curricular trends, many 
achievement tests today measure the attainment of relatively broad educa- 
tional goals that cut across different subject-matter areas. 

It will be recalled that the first step in developing a personnel selection 
test is a job analysis (Chapter 2). Similarly, the preparation of an achieve- 
ment test should begin with a specification of the educational objectives 
which the course of study was designed to achieve. A comprehensive anal- 
ysis of general educational objectives was undertaken by a committee of 
test specialists and educators. Three principal domains were recognized in 
these objectives, namely, cognitive, affective, and psychomotor. The results 
within the cognitive domain are reported in the first handbook of the Tax- 
onomy of Educational Objectives (13). The major categories in this domain 
include knowledge Cin the sense of remembered facts, terms, methods, prin- 
ciples, etc.), comprehension, application, analysis, synthesis, and evaluation. 
Each objective is further subdivided into finer and finer categories and illus- 
trated from a variety of fields. 

The specifications drawn up in planning an achievement test should show 
the topics to be covered, the kinds of learning to be tested (in terms of ob- 
jectives), and the relative importance of individual topics and objectives. 
On this basis, the number of items of each kind to be prepared on each 
topic can be determined. In the preparation of test specifications, as well as 
in item writing, psychologists work in close collaboration with “subject- 
matter specialists,” who are usually teachers at the appropriate levels. 

An outstanding example of an achievement test battery is the Sequential 
Tests of Educational Progress (STEP), developed by the Cooperative Test 
Division of the Educational Testing Service (23). These tests are available 
at several levels extending from the fourth grade of elementary school to the 
sophomore year of college. At each level there are seven tests, including 
multiple-choice tests in Reading, Writing, Mathematics, Science, Social Stud- 
ies, and Listening, as well as an Essay test. Although the need for specific 
knowledge in particular fields was recognized in constructing STEP items, 
major emphasis was placed upon the application of learned skills to the solu- 
tion of new problems. The tests are concerned more with the outcome of 
school learning than with its content. The STEP item writers displayed un- 
usual ingenuity and skill in composing items that do in fact measure many 
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of the intellectual skills outlined in the Taxonomy of Educational Objectives. 
Two examples typical of items found in the tests for Science (grades 4 to 6) 
and for Social Studies (grades 10 to 12) are reproduced in Figures 114 and 
115, respectively. 

Ideational Problem Solving. With the renewed interest in research on 
school learning, educational psychologists are turning more and more to cx- 
perimental analyses of problem-solving processes. Ideational problems inv 
ing verbal, mathematical, and other symbolic content are particularly relevant 
to school instruction. Some of the pertinent research has been conducted 
outside of educational contexts, although there is an increasing number of 
studies conducted under educational auspices. A variety of approaches | 
been followed by these investigators. Some have employed factor analysis in 


Fig. 114 Sample item from STEP Science Test for Grades 4 to 6. 
(Reproduced by permission of Cooperative Test Division, Educational 
Testing Service.) 


Situation: Tom wanted to learn which of three types of 
soil—clay, sand, or loam— would be best for growing lima 
beans. He found three flowerpots, put a different type of 
soil in each pot, and planted lima beans in each. He placed 
them side by side on the window sill and gave each pot the 
same amount of water, 


d 


LOAM CLAY SAND 


The lima beans grew best in the loam. Why did Mr. 
Jackson say Tom's experiment was NOT a good experi- 
ment and did NOT prove that loam was the best soil for 
plant growth? 


23 A The plants in one pot got more sunlight than the 
plants in the other pots, 
B The amount of soil in each pot was not the same. 
C One pot should have been placed in the dark. 
D Tom should have used three kinds of seeds. 
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Fig. 115 Sample items from STEP Social Studies Test for Grades 10 to 12. 
CReproduced by permission of Cooperative Test Division, Educational 
Testing Service.) 


The students are provided with monthly temperature and rainfall 
charts for four places, as shown below. 


J J, ,August 
M AO N 

FM December 

CITY Il 


Degrees of Temperature 


0 1 2 S*iabg 15506 7 Bes - Tort 
Inches of Rainfall 
They are asked such questions os: 
15 In which city would one need the greatest variety of weights of clothing? 
AI Bil CI DIV 
18 Which of these cities are north of the equator? 
E I and III only G All of the cities 
F Il and IV only H None of the cities 
This is one of several item groups designed to test understanding of “the 
profound effects of geographic environment on man's institutions and ways 
of living," one of the broad objectives around which sociol science items 
were constructed. Skill in chart reading is likewise tested by this item. 
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the effort to identify the abilities utilized in solving different kinds of prob- 
lems (e.g, 92). A growing number of studies are concentrating upon the 
problem-solving process itself, rather than considering only the solutions (o, 
14, 16, 70, 71, 110-112). 

One approach followed in a series of investigations utilizes the questions 
subjects ask in solving a problem, as a means of studying their problem- 
solving procedures (7o, 110, 111). To solve each problem, the subject must 
elicit certain items of information by asking appropriate questions. The num- 
ber and nature of these questions, as well as the sequence in which the sub- 
ject asks them, are analyzed in various ways to evaluate the effectiveness of 
the individual's problem-solving approach and to identify different problem- 
solving techniques. In some problems, the subject chooses among a set of 
given questions; in others, he generates his own questions. 

This procedure has also been employed in an exploratory study of the 
effects of training upon the problem-solving performance of high school and 
college students (112). Training consisted in solving a series of problems of 
a verbal and spatial nature, the training sessions being distributed over one 
semester. As the subject worked on the problems, the experimenter helped 
him to analyze his own problem-solving approach and to become aware of 
the methods he used, without, however, suggesting any specific problem- 
solving procedures. Comparison of matched experimental and control groups 
showed significantly greater improvement by the experimental group in solv- 
ing similar test problems. The experimental group also made larger gains in 
mathematics grades than did the control group, suggesting that the training 
may have transferred to achievement in this subject. A similar approach is 
currently being followed in a more direct investigation of the performance of 
high school students in mathematics (113). Groups of students selected be- 
cause of high and low mathematical ability, respectively, are being compared 
in their methods of solving mathematical problems. This type of study is 
especially appropriate today, when educators are engaged in the development 
of new mathematics curricula and teaching materials. 

Creativity. In Chapter 3, reference was made to the distinction between 
convergent and divergent thinking, proposed by Guilford (51). Essentially, 
convergent thinking leads to a single right answer determined by the given 
facts. Divergent thinking, on the other hand, “goes off in different directions” 
and leads to a diversity of novel and original answers. This distinction is 
useful in differentiating between the type of problem solving discussed in the 
preceding section and the creative activities to be considered in this section. 
In both psychology and education, there has been an upsurge of interest in 
the investigation of creative processes, the identification of creative indi- 
viduals, and the development and fostering of creative abilities. Such interest 
is evidenced by major conferences and symposia organized around these ques 
tions (e.g, 4, 132), as well as by several comprehensive research projects 
(see 133, 135, 136). 

Some preliminary results obtained in creativity research on adults were 
given in Chapter 3. In that connection, examples were also cited from various 
tests developed by Guilford and his associates (51-54). These tests were de- 
signed to measure the three principal classes of factors identified by Guilford 
and his co-workers in their factorial analyses of creative tasks, namely, flu 
ency, flexibility, and originality (see Chapter 3). Other investigators have 
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subsequently adapted some of the Guilford tests for use with younger sub- 
jects and have employed them, along with new tests, in educational research 
projects extending from the preschool to the high school level (42, 45, 103, 
133, 135, 136). An example of a test that is effective with young children 
is the "Improvements" test, in which the child is given toys — such as nurse 
kit, fire truck, and dog — and is asked to think of ways for changing each 
toy so it would be "more fun to play with" (135, 136). Thinking up un- 
usual uses for common objects, such as a toy, a tin can, or a brick, illustrates 
another technique that has proved applicable over a wide age range (45, 53, 
135, 136). 

A common finding in these investigations is that creativity involves many 
intellectual factors not covered by traditional intelligence tests. In unselected 
populations, the correlations between creativity tests and intelligence tests 
fall largely between .20 and .4o; in relatively homogeneous samples of indi- 
viduals at high intellectual levels, the correlations may be zero or even nega- 
tive (133). These relationships were illustrated in a study of high school 
students conducted by Getzels and Jackson (45). From a population of 449 
high school students attending the University of Chicago laboratory school, 
two groups were selected for intensive study: one high in intelligence but 
not in creativity, the other high in creativity but not in intelligence. The 
High Intelligence group consisted of 28 students who were in the top 20 
per cent in IQ? when compared with students of the same age and sex, 
but were below the top 20 per cent on a battery of five creativity tests; 
the High Creativity group consisted of 26 students falling in the top 20 per 
cent on the creativity battery but below the top 20 per cent in IQ. 

In school achievement, both the High Creativity and the High Intelligence 
groups excelled the total school population by the same amount. Since the 
High Creativity group did not differ significantly in IQ from the total school 
population, such students would traditionally be regarded as “overachievers.” 
These findings highlight the artificiality of the overachievement concept, as 
well as its dependence upon the particular measure of scholastic aptitude 
used to predict achievement level. Getzels and Jackson employed several 
other rating and testing techniques in their comparison of the two groups. 
In general, the High Creativity students tended to express unconventional 
values and aspirations, to place a high value on humor, and to utilize unusual 
associations in their fantasy productions. 

All these findings were closely corroborated in an extensive investigation 
of elementary schoolchildren conducted by Torrance (135, 136). Achieve- 
ment tests again revealed no differences between the High Intelligence and 
High Creativity groups. Nevertheless teachers rated the High Creativity 
cases as less desirable pupils, less ambitious, and less hardworking than the 
High Intelligence cases. Humor and playfulness were again conspicuous in 
the creative child's behavior. Other parts of Torrance’s continuing project are 
concerned with such problems as the creative pupil's estrangement from his 
peers, influences in and out of school that tend to interfere with the develop- 
ment of creative behavior, and the application of certain techniques of cre- 
ativity training to schoolchildren (132, 135, 136). 

With regard to the relation between intelligence test performance and 
? As determined from Stanford-Binet, WISC, or Henmon-Nelson group tests of intelli- 
Rence. 
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study was described as emphasizing structure, order, and planning, while 
exhibiting warm interpersonal relations and sensitivity to children's feelings. 
The authors point out that the high work-orientation and planfulness oí 
this type of teacher is in sharp contrast to some interpretations of "permis 
siveness.” The description of these effective teachers recalls at least two 
dimensions identified in the Teacher Characteristics Study, namely, “warm, 
understanding, friendly" and “responsible, businesslike, systematic.” There 
are also some interesting parallels with the dimensions of supervisory be 
havior discussed in Chapter 6. It will be recalled that the most effective 
supervisors were those who rated high in both the “Consideration” and 
“Structure” dimensions. Since a major part of the teacher's role consists in 
serving as a group leader or supervisor, it is reasonable to expect such simi 
larities. 

Teaching Media. An increasing number of experimental psychologists 
who had been engaged in military research on training media have bcc- 
turning their attention to educational problems. Evidence of such intere: 
may be found in the reports of several conferences organized to survey the 
educational implications of available training research (36, 47, 99). Much of 
this research centers around training media (see also Chapter 5). In the 
broadest sense, such media include any method for communicating information 
or providing an opportunity to learn. Among such media should be listed text- 
books, instructors, classmates, and even solitary intellectual activity (36, pp. 
196-206). 

Particular attention has been given to so-called audio-visual media, involving 
the use of motion-picture films, tape recordings, sound filmstrips, and tele- 
vision. Such media have been used in at least three ways. First, they may serve 
as teaching aids (along with charts, mock-ups, demonstrations, and similar 
materials) to supplement conventional teaching methods. Second, they may 
carry the major instructional load in special educational programs for such 
groups as the physically handicapped, hospitalized patients, prisoners, or adults 
who find it impracticable to attend classes because of distance, lack of time, 
domestic responsibilities, or other reasons. Finally, they may constitute the 
primary instructional media in regular school classes. Obviously it is this third 
application that has the broadest implications for the future of education. 

Research on audio-visual media may be subdivided into evaluative studies of 
specific instruments and experimental investigations of variables that influence 
learning. The first type is concerned with effectiveness of a single film, tele- 
vision course, or other instrument within a particular context. Typical of the 
earlier research in this field, these evaluative studies sought to answer such 
questions as "Can students learn through television?" or “How does the amount 
learned from this film compare with the amount learned from conventional 
instruction?” Because the many variables involved in the methods under in- 
vestigation are not isolated, it is impossible to generalize the findings of such 
evaluative studies beyond the situations investigated. For example, the results 
would be restricted to a particular film as compared to a particular instructor 
using face-to-face communication. 

A more recent development, dating largely from World War II, is the 
systematic and controlled investigation of individual variables in relation to 
the effectiveness of audio-visual training media. Although the bulk of this 
research was conducted in military contexts, studies of school learning are 
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gradually appearing. Most available investigations deal with films, but their 
findings usually apply equally well to television. Among the many variables 
investigated are different ways of providing active student responses in the 
course of learning and the relative effectiveness of these procedures (85, 86). 
Overt responses have also been compared with covert, or mental, responses. 
Different types of feedback, reinforcement, and knowledge of results have like- 
wise been investigated. Other factors include such stimulus characteristics as 
color, animation (e.g. moving arrows, pop-on labels), and difficulty level of 
accompanying oral commentary as measured by Flesch formulas or other read- 
ability indices. Still other studies have been concerned with repetition, review, 
rest pauses, organization of learning units within a film, and techniques for 
calling attention to certain aspects of the film, as through advance instructions 
and preliminary or interpolated tests. 

Some attention has also been given to the instructor as a training medium 
(36, pp. 22-33). When regarded simply as an instrument for conveying in- 
formation, the instructor has both advantages and disadvantages in comparison 
with mechanized media. In the film, all students can be exposed to carefully 
planned teaching by the best available instructors. Along with oral communica- 
tion, the film can employ visual demonstrations that might otherwise be im- 
practicable or even impossible in a classroom situation. Chief among the 
special advantages of a live instructor, on the other hand, is his opportunity to 
adjust his presentation to the characteristics of different audiences and to 
respond to continuous feedback from his listeners. If a film fails to put a point 
across, it keeps right on going. But if a lecturer is faced by puzzled frowns, 
blank stares, or signs of restlessness, he can try again or change his approach. 

Suggestive findings on the influence of audience feedback upon the effec- 
tiveness of communication were provided by an experiment in which students 
drew complex geometric patterns that were described orally by the experimenter 
but were not seen by the class (83). Several feedback conditions were in- 
vestigated, ranging from zero feedback, with the experimenter behind a screen 
and no questions or noise permitted, to free feedback, with the audience in 
view and questions permitted. Results showed that, with more feedback, ac- 
curacy of communication increased, confidence was higher on the part of both 
experimenter and students, and less hostility was manifested by the students. 

The instructor may also function as a discussion leader (36, pp. 28-31). In 
this connection, some of the exponents of client-centered counseling and psy- 
chotherapy have been exploring the possibilities of student-centered teaching 
(see 36, pp. 28-31; 88; 115, ch. 9; 116, chs. 13-15). In applying these tech- 
niques to teaching, one should note certain essential differences between teach- 
ing and psychotherapy. Rogers (115) points out, for example, that, while the 
therapist responds only by clarifying feelings and attitudes, the teacher must 
respond to both intellectual content and emotional attitudes expressed by the 
group. The teacher should acquaint his students with sources of information, 
opportunities for practicing skills, and other relevant resources with which they 
can implement their objectives. He himself also serves “as a flexible resource to 
be utilized by the group in the ways which seem most meaningful to them 
(115, p. 402). k 

Student-centered teaching may be quite effective for attaining certain educa- 
tional objectives and for teaching certain types of subject matter. For other 
purposes, however, different procedures are likely to be more satisfactory. The 
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relative success of student-centered teaching probably varies, too, with the 
educational level and other qualifications of the students and with the per- 
sonality characteristics of the instructor. An interesting illustration of such 
interaction effects was provided by an experiment in which students were 
randomly assigned to four instructional procedures varying in degree of student 
contacts with other students and with an instructor (10). A significant inter- 
action was found between instructional procedure and sociability (or social 
introversion-extroversion ) as measured by a standardized personality inventory. 
Thus, in the group taught by the lecture method, the less sociable students 
made significantly greater achievement gains than did the more sociable; among 
those assigned to small autonomous discussion groups with no instructor, on the 
other hand, the more sociable students made significantly greater achievement 
gains than did the less sociable. 

In this discussion of instructional media, no mention has been made of 
recent applications of teaching machines and programmed instruction to edu- 
cation. These developments are of such magnitude and potential significance 
as to justify their separate treatment in the following section. 


TEACHING MACHINES AND OTHER 
TECHNOLOGICAL DEVELOPMENTS 


Teaching Machines. In Chapter 5, teaching machines were briefly dis- 
cussed, along with training devices and simulators, with special reference to 
their uses in industrial and military training. It will be recalled that teaching 
machines are essentially instructional devices that afford an opportunity for 
active learner participation, provide immediate feedback or knowledge of 
results, and permit each individual to progress at his own pace. As carly as 
1924, Pressey developed a teaching machine that was an adaptation of a self- 
scoring test (87, pp. 35-41; 94; 104). The student answered multiple-choice 
questions by pressing a key on the machine. If his answer was correct, the 
machine automatically exposed the next item; if it was incorrect, the machine 
did not advance to the next item until the correct key was pressed. 

Over the next thirty years, Pressey and his students developed other forms 
of this device and gathered data indicating its effectiveness in accelerating 
classroom learning (36, 87, 105). This type of teaching machine, which 
Pressey still advocates, differs from the more common current types in that it 
represents an adjunct to traditional instruction, or a training aid, rather than 
a method for accomplishing the complete instructional task (106). Such a 
teaching machine presupposes that some initial learning has already occurred 
through textbook reading, lectures, laboratory work, field trips, or other in- 
structional procedures. 

Pressey's original teaching machines had little effect upon educational 
practices, probably because they were introduced at a time when neither edu- 
cation nor society at large was ready for such innovations. It was not until 
about 1958 that the current teaching machine movement got under way; but 
since that time it has proceeded at a frenetic pace. Several factors contributed 
to this development. Learning theory had made many advances over the pre- 
ceding quarter century and at least some learning theorists were beginning to 
turn their attention to the educational implications of their research. An im- 
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portant milestone in this connection was Skinner's 1954 article on "The Science 
of Learning and the Art of Teaching" (123). Another influence stemmed from 
the training technology of World War II, with its automatic training devices 
developed in the urgency of military needs. Still another contributing factor 
was the shortage of qualified teachers, combined with various pressures upon 
educational systems to augment their instructional facilities. 

The development of teaching machines and their educational implications 
have been surveyed in a number of books (e.g., 31, 40, 67, 125) and many 
articles Ce.g, 35, 94, 128). A good introduction to the learning theory underly- 
ing these technological developments is given by Green (50). The most com- 
prehensive source is the volume assembled by Lumsdaine and Glaser (87) for 
the Department of Audio-Visual Instruction of the National Educational 
\ssociation. While most educational psychologists feel that teaching machines 
will probably make a valuable and lasting contribution to education, they have 
become concerned about the overemphasis on “hardware” and gadgetry that 
characterizes the present movement. Some commercial distributors, moreover, 
have been promoting the sale of teaching machines too vigorously and too 
soon. 

In the effort to help potential users to evaluate the bewildering array of 
available self-instructional materials, psychologists and educators have tried to 
provide some guidelines. An example is an interim statement about teaching 
machines issued jointly in 1961 by the American Psychological Association, 
the American Educational Research Association, and the Department of Audio- 
Visual Instruction of the National Educational Association (120). As ob- 
served in this statement, it is not the “machine” that teaches, but rather the 
instructional content or program of materials presented through the machine. 
Such a program needs to be evaluated from the standpoint of both curriculum 
development and learning technology. Of interest in this connection is a 
project under way at Educational Testing Service to develop practical, quality 
control standards for programmed learning (129). Also noteworthy are the 
organized efforts to induce a wider participation by teachers in the construction 
and testing of teaching machine programs (80).° 

Programmed Learning. “Programming” of auto-instructional materials 
refers to the preparation of the content of a teaching program. Particular atten- 
tion is given to the formulation of specific steps into which the subject matter 
is analyzed and the sequence in which these steps are presented. There are two 
major programming techniques, identified with Skinner (87, part III; 124) and 
Crowder (27, 28), respectively. Both types are designed to provide self- 
sufficient instructional programs, in contrast to the approach represented by 
Pressey. 

Skinner’s programming technique is a direct outgrowth of his learning 
theory and his basic laboratory research with animals. Known as linear pro- 
gramming, this technique employs the same sequence of steps for all indi- 
viduals. The material to be learned is presented in small, easy steps, formulated 
so as to reduce the possibility of response errors to a minimum. For each item, 
the learner “constructs” his response either by writing or by moving sliders 
that cause the desired letters or numbers to appear. Knowledge of results 
Follows immediately upon completion of each response. Figures 116 and 117 


8 See also publications of The Center for Programed Instruction, Inc., New York. 
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Fig. 116 College student using a Skinner-type teaching machine. (Courtesy 
of B. F. Skinner.) 


show a college student and a schoolchild using machines which require these 
two response modes. 

The object of the Skinnerian technique is to elicit or “shape” the correct 
response by prompting or response cueing. The needed information is con- 
veyed by the items themselves through a wide variety of ingenious prompts, a 
few of which are illustrated in Table 23. By easy steps that virtually assure 
success and hence positive reinforcement of responses, the individual pro- 
gresses from items providing maximal prompting to items providing minimal 
prompting. This process of progressive cue reduction is called “vanishing” and 
is basic to Skinner's method of shaping responses. 

In contrast to Skinner's linear programming, Crowder employs a branching 
technique. This type of program characteristically utilizes multiple-choice 
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Fig. 117  Schoolchild using Skinner-type arithmetic teaching machine. 
CCourtesy of B. F. Skinner.) 


TABLE 23 A Set of Frames Employed in Skinner Programming to 
Teach a Third- or Fourth-grade Pupil to Spell a Word 
(From Skinner, 124, p. 972) 


l. Manufacture means to make or build. Chair factories manufacture chairs. Copy 
the word here: 


OO ] ] 


2. Part of the word is like part of the word factory. Both parts come from an old 
word meaning make or build. 


manu[] Oure 


3. Part of the word is like part of the word manual. Both parts come from an old 
word for hand. Many things used to be made by hand. 


OOOD facture 
4. The same letter goes in both spaces: 


m[]nuf[]cture 


5. The same letter goes in both spaces: 
man[jfact[]re 


6. Chair factories agaaagaua dadCQ atc og chairs. 
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rather than constructed responses. Depending upon the response option chosen 
by the individual, he is channeled through a different route. If he chooses the 
correct response, the machine moves on to the next item. A wrong respo 
however, is followed by explanations and review designed to correct the error 
specific to that response. Wrong responses thus route the individual through 
remedial branches or detours. Since all these alternative routes are built into 
the program, the Crowder technique is also described as "intrinsic program 
ming." Another feature of the Crowder technique is that new information is 
presented by conventional methods of communication. Each item exposed on 
the viewing screen of the machine typically contains an informational passage 
followed by multiple-choice questions on that unit. The steps in this type of 
program are also larger than in the Skinner type. Rather than trying to 
eliminate errors, Crowder programming utilizes errors to clarify communication 
through remedial loops. An example of a teaching machine designed for use 
with Crowder programming is reproduced in Figure 118. 

Although most teaching machine programs follow either the Skinner or tse 
Crowder approach, individual features of the two methods may be recombined 


Fig. 118 A teaching machine designed for use with Crowder programming. 
(AutoT utor Mark II, reproduced by courtesy of U.S. Industries, Educational 
Science Division.) 
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in different ways and a number of minor variations have been worked out.* 
Each method has certain advantages and disadvantages, and their relative 
merits have been widely debated. Because of the multiplicity of educational 
objectives and the diversity of subject matter to be taught, it appears likely 
that different methods will be suitable for different purposes (15). This is 
true of Pressey’s adjunct programming, as well as of the linear and branching 
types of programming described above. Moreover, auto-instructional techniques 
should be considered together with other, more conventional instructional 
procedures in designing effective educational systems. 

‘Teaching machines undoubtedly have much to contribute. They save teacher 
time that would be spent on such routine tasks as drill, grading papers, and 
record keeping. But even more important is the fact that teaching machines 
can accomplish certain teaching functions more efficiently than conventional 
teaching procedures. They can, for example, provide individualized tutorial 
instruction to a whole class simultaneously. The results that can be achieved 
with such improved procedures have barely been glimpsed. For instance, logic 
and other “advanced” subjects have been successfully taught to elementary 
schoolchildren; and three-year-olds have been taught to read and write (see 
50, p. 197). These findings challenge the traditional educational concept of 
readiness. 

‘Teaching machines also lend themselves to the teaching of new develop- 
ments in science and other rapidly expanding fields, for which it would be 
difficult to find enough adequately trained teachers (80). From a different 
angle, teaching machines permit a more complete control of conditions for 
applied research on school learning. With such automated devices, one factor 
at a time can be isolated and varied systematically. The effect of individual 
differences among teachers, which complicates the interpretation of most class- 
room experiments, can thus be ruled out. 

Of particular interest, too, is the possibility of utilizing teaching machines 
with delinquent, mentally retarded, behavior-problem, and other types of 
children who may have developed negative attitudes toward teachers and other 
authority figures. The impersonal nature of the teaching machine and the high 
proportion of successful responses serve to counteract these earlier attitudes 
C15, 89, 90). Because of the saving in teaching personnel and hence in total 
instructional cost, teaching machines are also useful in prisons and other insti- 
tutional settings. An exploratory study of the application of teaching machines 
is under way at the Draper Correctional Center in Elmore, Alabama (89, 90). 
Still another application of teaching machines is to be found in the educational 
programs of underdeveloped countries, particularly for literacy training and 
for elementary courses in science and mathematics. Teaching machines have 
already been introduced on an experimental basis in several African nations." 

The art of programming itself, ushered in by teaching machines, is begin- 
ning to influence other teaching media. Programmed textbooks on many sub- 
jects have been prepared in accordance with both Skinnerian and Crowder 
principles (e.g., 28, 64). Conventional classroom lectures have been made more 


‘A comprehensive survey of auto-instructional devices and the types of programming 
for which they are suitable is given by Kopstein and Shillestad (81). The United 
States Office of Education publishes a guide to available instructional programs for use 
in schools. 


* See, e.g., the October, 1962. issue of Auto-Instructional Devices. 
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effective by giving some attention to the programming of their content. In fact, 
one investigation found no significant differences in student achievement as a 
result of using teaching machines, programmed texts, or programmed lectures, 
although all three of these methods proved superior to ordinary lectures (114). 
Programming may also be useful as a teacher training experience, in intro 
ducing the teacher to concrete applications of learning principles to the sort of 
material she plans to teach (80). Teaching machines may succeed in breaking 
down the wall that has for so long separated the learning laboratory from the 
classroom. If they accomplish no more than this, the effort that has gone into 
their development will have been well spent. 

Systems Development. The concepts of man-machine systems utilized in 
engineering psychology (see Chapter 9) are also finding applications in educa 
tion. In the terminology of engineering psychology, teaching machines are 
“closed-loop” systems, since they provide continuous feedback. By contrast, 
films, television, and formal lectures are “open-loop” systems, in which informa- 
tion flows in only one direction. By means of computers, many aspects of 
educational systems can be simulated and hence investigated under controlled 
conditions. Computers, along with computer-based teaching machines, can be 
utilized in every major activity of educational systems, including instruction, 
student guidance, and administration. 

In several centers, exploratory research is in progress on the use of computers 
in automated instructional systems (24; 81, pp. 81-90). An example is the on 
going IBM project on computer-controlled teaching machines (24, pp. 171- 
190; 108; 144). A digital computer can simulate any type of teaching machine. 
Owing to its flexibility, extensive storage capacity or “memory,” and rapid 
decision-making power, it can also come close to simulating the "conversational 
interaction" between a human tutor and an individual student. In the IBM 
project, programs have been prepared and tried out for teaching such varied 
subjects as stenotypy, psychological statistics, and German reading. Information 
is initially presented by means of a textbook, although individual reading 
assignments are given out by the computer. Using an electric typewriter, the 
student constructs his responses (as in Skinner programming), but the in- 
structional sequence follows a branching pattern (as in Crowder program 
ming). However, the branching is not predetermined or intrinsic to the pro- 
gram but is tailored to the individual student. On the basis of each student's 
stored response history, the computer chooses individual items and assembles 
exercises appropriate to the student's own past performance. 

n a more comprehensive approach, psychologists at System Development 
Corporation (17; 20; 24, pp. 191-204; 122) have been experimenting with a 
simulated school conducted in a specially built Computer-based Laboratory for 
Automated School Systems (CLASS). This laboratory contains classroom 
facilities for 20 students, a guidance center, and a principal's office. Three 
aspects of the educational process are being investigated in an integrated 
system: (1) individual teaching machines with highly flexible, computer- 
controlled programming; (2) group instruction by means of films, television, 
and conventional teaching procedures; (3) centralized data processing for 
administrative, student-guidance, and planning functions. 

Some attention is also being given to other potential uses of computers in 
educational systems (24, pp. 3-12; 25; 30; 141). One example is the develop 
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ment of information-retrieval systems utilizing computer-based libraries at 
regional or national centers. Through such a system, students, instructors, and 
research workers could obtain information on the current state of knowledge 
in any area of the sciences or arts, without an individual searching process. 
Information now scattered in several geographically remote libraries could thus 
be made available simultaneously to an individual in any given location. 

Another illustration is provided by the utilization of computers in making 
administrative decisions regarding students, staff personnel, equipment, and 
other educational matters. Computers are already in use in many school dis- 
tricts to process administrative data such as attendance records, registration 
statistics, payroll, budgetary appropriations, and the like. Their application can 
be further extended to provide the required information for allocating and 
scheduling rooms, equipment, and other educational resources for more effective 
use; for helping students as well as colleges to make decisions regarding college 
admissions; for placement of students in courses suited to their qualifications; 
and for diagnosing learning difficulties requiring remedial instruction. A 
related application is in training teachers and educational administrators in 
decision-making functions. For this purpose, operational gaming procedures 
could be devised similar to the management games discussed in Chapter 5. 
Within a period of months, such computer-controlled games could provide the 
trainee with the equivalent of several years of experience in decision making 
within the appropriate context. 


SUMMARY 


Chief among the fields in which psychologists work in a consulting or 
collaborative relation with other professional personnel are education, medicine, 
and law. Within the field of education, the roles of psychologists may be 
differentiated into those of school psychologist and educational psychologist. 
Although the functions of school psychologists vary widely from school to 
school, they consist largely in the diagnostic testing and interviewing of indi- 
vidual pupils, followed by recommendations for remedial action. Working 
through teachers and other educational personnel, the school psychologist 
utilizes the school environment as a therapeutic medium. 

Educational psychologists are concerned principally with teacher training 
and educational research. Texts and courses on educational psychology focus 
upon the learning process, although there is still a wide gap between the psy- 
chology of learning and the technology of teaching. It is recognized that the 
preparation of teachers should also include some familiarity with other areas 
of psychology, notably developmental psychology, differential psychology, psy- 
chological testing, the psychology of adjustment, and social psychology. 

The term educational research has traditionally included many activities 
that fall outside the domain of research as defined in science. With a few 
notable exceptions, research in educational psychology made little progress 
until midcentury. At that time, several developments in education, in psychol- 
ogy, and in society at large contributed to an upsurge of psychological research 
in educational contexts. Even before that time, psychologists had been working 
effectively with educators in test construction and in the analysis of test 
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results. Currently, educational psychologists are also conducting research in 
such areas as those of ideational problem solving, creativity, teacher character- 
istics, and teaching media. 

Teaching machines and programmed instruction represent one application 
of the psychology of learning to teaching methods. Teaching machines are 
automatic devices that provide for active learner participation, immediate feed- 
back, and individual pacing. The instructional content presented by the teach- 
ing machine is known as a program. There are two major programming 
techniques, namely, linear and branching programming. Exploratory research 
is under way on the use of computers in the operation of teaching machines, 
as well as on broader applications of computers to educational systems. 
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The role of emotional, motivational, and interpersonal factors in medical 
practice has long been recognized. In fact, such recognition antedates modern 
scientific medicine, being of primary significance in the ministrations of 'he 
medicine men of early preliterate cultures. In more recent times, physici: ns 
have generally been aware of the importance of many “psychological” factors 
in the daily practice of medicine. Examples include the contribution of doctor- 
patient relations and of the doctor's “bedside manner" to the patient's recovery; 
the need for considering personality, familial, and cultural factors in diagnosing 
a patient's condition; and the dependence of physical improvement upon the 
individual patient's cooperation and his motivation to recover. 

Despite the many potential applications of psychological knowledge to 
medicine, however, contacts between the two fields were negligible until quite 
recently. Even books on the psychology of medical practice or the psychology 
of physical illness were often written, not by psychologists, but by psychiatrists 
Csee, e.g., 9, 44). This was also true of courses or units on psychology taught 
in medical schools. With their strongly pathological orientation, psychiatrists 
are likely to provide a distorted picture of the psychologically normal patients 
encountered in other branches of medicine. Moreover, the training and experi- 
ence of psychiatrists emphasize clinical practice rather than basic behavioral 
science. Consequently, their notions of human behavior tend to derive from 
uncontrolled personal observation and selected cases rather than from systematic 
and controlled experimentation. 

Tt was not until after World War II that professional contacts between 
psychology and medicine attained significant proportions, and even now the 
potential contributions of psychology in this field are largely undeveloped. 
Originally, the contacts occurred almost entirely between clinical psychologists 
and psychiatrists and were limited to psychodiagnostic functions. Later, an in- 
creasing number of referrals for psychodiagnosis started to come from other 
medical specialists, such as pediatricians, internists, neurosurgeons, plastic 
surgeons, obstetricians, and endocrinologists (63). This trend reflected a grow- 
ing realization among medical practitioners that a standardized assessment of 
the patient’s emotional responses, intellectual functioning, and other psycho- 
logical characteristics would in many cases facilitate the diagnosis of physical 
difficulties, After midcentury, the role of psychologists in medical practice 
expanded to include consulting relations in other than diagnostic functions. The 
growth of collaborative psychological research in medical settings and of 
medical school teaching by psychologists was also characteristic of this period. 
Finally, contacts with medicine were no longer limited to clinical psychologists 
but came to include counseling, experimental, physiological, developmental, 
social, and practically all other kinds of psychologists. 
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PSYCHOLOGY IN MEDICAL PRACTICE 


In every major phase of medical practice, psychology can contribute in two 
principal ways: (1) by providing knowledge and orientations that improve the 
physician's performance of his own functions; (2) by providing the services 
of psychologists as consultants or as participating members of interprofessional 
teams. These two avenues of contact between psychology and medicine are not 
entirely distinct. Psychological orientations, for example, may be acquired by 
the physician, not only during medical school training, but also in the course 
of consulting and collaborative relationships with psychologists. In the present 
section, we shall illustrate the types of contributions psychology can make to 
diagnosis, treatment, rehabilitation, and preventive medicine. 

Diagnosis. Mention has already been made of the increasing use of psycho- 
diagnostic referrals in nearly every branch of medicine. In such cases, tke 
physician integrates the psychodiagnostic data with medical data in arriving at 
his diagnosis. In his own diagnostic functions, moreover, every physician relies 
at least in part upon information elicited from the patient through interview- 
ing techniques. Training in effective interviewing procedures is thus as relevant 
to this field as it is to personnel selection (Chapter 4) or to the practice of 
clinical psychology (Chapter 13). Much can be accomplished by simply re- 
cording and playing back typical diagnostic interviews, thereby alerting the 
physician to common errors (see 49, chs. 1 and 10). Some familiarity with the 
findings of psychological research on interviewing procedures is also helpful 
Csee Chapter 4). 

For a variety of reasons, the medical interview presents special difficulties 
for effective communication and hence requires a particularly skillful inter- 
viewer. For the patient, the situation is emotionally loaded and is likely to 
arouse ambivalent attitudes. On the one hand, the patient wants to communi- 
cate fully in order to obtain maximum help from the physician. On the other 
hand, he is reluctant to face the possibility that he may require long or un- 
pleasant therapy or that he may be suffering from a serious disease. The com- 
bination of these conflicting tendencies may distort the patient's report in a 
number of ways. The patient's emotional involvement also heightens his sug 
gestibility. He will thus be quick to respond to slight cues in the physician's 
queries that may suggest symptoms he had not really observed. He is also likely 
to read unintended meanings into the physician's remarks, tone of voice, and 
other expressive reactions. Communication is further obscured by differences 
in language between physician and patient. Such differences may arise, not 
only from medical terminology and from the different connotations of common 
words to layman and physician, but also from possible differences in socio- 
economic level, general education, and cultural background. ; 

Mention should also be made of the extent to which medical diagnosis 
utilizes introspective report regarding the intensity, qualitative characteristics, 
and localization of pain, as well as other bodily sensations. The extensive 
psychological research on sensory and perceptual responses and on the factors 
influencing their accuracy is pertinent to an evaluation of such reports. A 
more specific example is provided by the detailed visual examinations con- 
ducted in ophthalmology. The standard psychophysical procedures of the psy- 
chological laboratory have much to contribute to these examinations. ‘ 

Treatment. There has been a growing interest in the nature of the inter- 
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personal relation between physician and patient and in the part which this 
relation plays in the recovery process (see, e.g., 44, 54, 67). Theoretical 
analyses of this relation have been proposed in terms of the attitudes of both 
patient and physician toward each other. The patient's attitudes, for example, 
may vary from passive dependence or awed admiration to skepticism or active 
resistance. The physician, on his part, may exhibit varying degrees of empathy 
and warmth or impersonal objectivity and aloofness. The physician-patient 
relationship may also be characterized in terms of a continuum ranging from 
authoritarian control through guidance, to mutual participation. In all its 
aspects, the relationship varies as a function of the nature and severity of the 
patient's illness, the personality and intellectual characteristics of both patient 
and physician, the physician's reputation, the patient's earlier contacts with 
physicians, and many other circumstances. 

Although empirical data on the effects of the physician-patient relationship 
are meager, it seems evident that the nature of this relationship will be reflected 
in the patient's cooperation in carrying out medical orders. That the rela‘ion- 
ship itself may have therapeutic value is also widely recognized. Contact with 
a physician in whom the patient has confidence frequently provides an emo- 
tional "lift" accompanied by a noticeable amelioration of symptoms. In many 
cases, this effect may contribute to the patient's ultimate recovery. 

From another angle, familiarity with a patient's cultural background may 
prove very helpful to the physician (3, 77). Behavior that might appear bizarre 
and irrational becomes intelligible when viewed against the frame of reference 
of the patient's own culture. Deep-rooted cultural beliefs and traditions may 
profoundly affect attitudes toward illness, doctors, hospitals, and other medical 
matters. À doctor who understands the patient's culture enjoys an advantage, 
not only in arriving at a correct diagnosis, but also in presenting required 
therapeutic procedures in ways that are acceptable to the patient. 

The application of psychology to medical practice has been furthered by the 
recent movement toward comprehensive medicine (35, 39, 64). Arising as a 
reaction against the increasing specialization of medical practice during the 
first half of the twentieth century, comprehensive medicine focuses on the 
treatment of the whole person. With the gradual disappearance of the family 
doctor, medicine was losing sight of the patient. Comprehensive medicine is 
an effort to reverse this trend. Illness is viewed, not as a specific disorder of an 
isolated organ, but as a complex relation between the organism and his environ: 
ment (39). The environmental stresses to which the organism is subjected may 
range from bacterial infection to interpersonal relations and cultural pressures. 
The resulting physiological and behavioral changes depend partly upon the 
nature of these environmental stresses and partly upon the state of the indi- 
vidual at the time. This state in turn reflects the individual's heredity and the 
residual effects of his previous life experiences. 

In the effort to train medical students in the practice of comprehensive 
medicine, medical schools have been experimenting with a number of specially 
designed instructional procedures, such as joint teaching by different depart- 
ments on hospital wards, continuing case studies of individual patients, and 
family care programs (42, 62). In the family care programs, each medical 
student is assigned a particular family whose members he treats in their natural 
environment during his medical training. This procedure permits a continuing 
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relationship between the medical student and individual patients, as well as 
providing him with relevant information about the family setting and home 
background of the patients. 

The participation of psychologists themselves in medical treatment is ex- 
emplified by the appointment of psychologists in hospitals for orthopedic dis- 
abilities, tuberculosis, and other physical disorders. There is a growing recogni- 
tion that in these settings the psychologist can function most effectively as a 
consultant to ward physicians, nurses, and other staff members (14; 40, ch. 24). 
Rather than spending a few hours a week with a handful of patients, the psy- 
chologist can thus help the hospital staff to deal more effectively with the 
daily problems arising in patient care. This staff-oriented approach is in line 
with the milieu therapy of psychiatric patients described in Chapter 14. It is 
also similar to the role of the school psychologist insofar as the latter works 
through teachers and other school personnel who are in daily contact with the 
child (Chapter 18). 

Rehabilitation. ‘The rehabilitation movement originated after World War I 
in an effort to facilitate the vocational readjustment of disabled war veterans. 
Jt was not until after World War II, however, that the movement underwent 
a significant expansion in scope and functions (30, 31, 104). At that time, its 
objectives were broadened to include not only vocational training and place- 
ment but also personal, familial, and community adjustment. The categories of 
patients reached by rehabilitation services were also increased to cover all types 
of injuries and illnesses, including psychiatric disorders. For any patient, it was 
recognized that convalescent care should serve to prepare the individual for a 
return to community living. Depending upon the nature of the disorder, the 
duration of the illness, and the severity of residual handicap, the rehabilitation 
program may vary from a brief counseling session to a long period of extensive 
retraining, 

Today rehabilitation is typically carried on by teams of professional workers 
drawn from many fields, including physicians, nurses, physical therapists, oc- 
cupational therapists, speech therapists, psychologists, social workers, voca- 
tional counselors, prosthetic specialists, and others. The focus is on the total ad- 
justment of the individual to his environment, rather than on the disability 
as such. Moreover, it is generally recognized that the individual’s reaction to 
his disability is often more important than the disability itself in determining 
his work capacity and general adjustment. The attitudes of family, employers, 
teachers, and other significant associates are also major factors in the patient's 
progress. 

The psychologist’s role on the rehabilitation team includes not only assess- 
ment and counseling of the patient himself but also consultation with other 
team members. Accurate assessment of motor and intellectual skills, emotional 
traits, motivation, and attitudes is of fundamental importance in planning the 
rehabilitation program, in determining the effectiveness of remedial procedures, 
and in detecting progressive changes in the patient. In severely disabled per- 
sons, even slight increments or decrements in function may drastically alter 
capacity for self-care, for job performance, and for other practical activities. 
Consequently, precise and repeated assessments of abilities are needed. On 
the basis of his continuing evaluation of the patient through testing and inter- 
viewing, the psychologist can help in the planning and revision of the rehabili- 
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tation program at all stages. In his consultative functions, he can also enhance 
the psychological orientation of the other team members, alerting them to the 
psychological aspects of the patient's adjustment to disability. 

Preventive Medicine. In both individual medical practice and public 
health programs, preventive medicine needs the cooperation of the lay public. 
The availability of preventive medical techniques such as the use of polio- 
vaccine can be of no help if the public does not take advantage of them. Early 
detection and treatment of such conditions as cancer and tuberculosis likewise 
depend upon the knowledge and attitudes of the individual patients. To be 
effective, preventive medicine requires a well-informed public with appropriate 
attitudes toward disease prevention. 

Public information campaigns in the field of preventive medicine can utilize 
the methodology and research findings of consumer psychology (see Chapter 
10). In the first place, data are needed about existing public attitudes or mis- 
information that may interfere with the adoption of any given preventive 
measure. Superstitions, cultural traditions, and deep-rooted emotional reactions 
that would cause resistance must be identified. For example, efforts have been 
made to investigate such questions as why people fail to seek poliomyelitis 
vaccination, why they oppose community water fluoridation, and why they 
do not report for treatment following a cancer diagnosis (46, 5o, 71, 76). 

In one survey, the investigator contacted the families of 227 patients who 
had received a diagnosis of cancer and who later died of cancer without hav- 
ing reported for treatment (see 46, p. 430). In one-third of these cases, the 
reasons given for failure to return for treatment were fear and unwillingness 
to admit the diagnosis. Such findings illustrate an all too frequent reaction to 
certain illnesses that have acquired an aura of mystery and dread. "Fear of 
knowing" undoubtedly keeps many persons from even seeking a diagnosis. In 
a more recent survey (55), 808 New York City residents were interviewed 
regarding their medical histories, past health behavior, and current health atti- 
tudes. Results showed that delays in seeking medical care occurred. signifi 
cantly more often for cancer symptoms than for other medical symptoms. 
Delay in seeking care for cancer was also found to be associated with low 
educational and socioeconomic levels. 

Knowledge about prevailing public attitudes is a necessary first step in 
planning a public information campaign. The next step is to choose effective 
means of communication (47, 81). "Media research" is as relevant here as in 
the advertising of consumer goods. One survey, for example, was concerned 
with the sources of public information about poliovaccine (47). Both socio 
economic and urban-rural differences were found in the extent to which indi 
viduals received their information from physicians, newspapers, radio, and 
other media. Another investigator compared the effectiveness of different types 
of illustrated lectures on dental hygiene (28). He found that a straight factual 
presentation was more successful in improving the oral-hygiene practices of 
high school students than were the more emotional, fear-arousing lectures. 
Although a few psychologists are currently working on problems such as 
these, pertinent research is still very meager. We certainly know much more 
about people's attitudes toward political issues and canned soups than about 
their attitudes toward preventive medicine. 
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PSYCHOLOGY IN MEDICAL EDUCATION 


Current interest in the role of psychologists in medical schools is evidenced by 
the number of conferences, symposia, professional committees, and articles that 
have been concerned with it (17, 34, 35, 45, 51, 52, 58, 59, 63-65, 82, 9o, 
98). All this activity has been concentrated into little more than a decade. 
Prior to World War II, psychologists were rarely employed by medical schools. 
What little psychology was included in medical school curricula was taught 
by psychiatrists. With the growth of clinical psychology following World 
War II, psychologists began to appear on medical school faculties on a part- 
time basis, to give one or two lectures in psychiatry courses and to demonstrate 
psychological tests during the students’ clinical years. Later, training in psy- 
chology came to be regarded as the “basic science training” for psychiatry. Still 
later—and more slowly—it is being recognized as one of the basic sciences for 
the training of all medical students, along with physiology, anatomy, bio- 
chemistry, and other biological sciences (59). Concurrently, there has been a 
marked upsurge of research activities by psychologists in medical schools. 

As the area of participation broadened, the number of psychologists in 
medical schools rose sharply. A 1952 survey (65) revealed that 73 per cent of 
the country's medical schools employed one or more psychologists on their 
staff. The total number of psychologists so employed was 255, over half being 
on full-time appointments. Within the next ro years, this number rose to ap- 
proximately 600, and nearly every 4-year medical school in the country was 
represented (66, p. 234). 

The amount of formal instruction currently provided by psychologists in 
medical schools ranges from a few lectures in a course given jointly by several 
departments to full responsibility for one or more courses. There is a growing 
trend toward establishing separate divisions or departments of "medical psy- 
chology" or "behavioral science" (45, 58, 59, 9o, 98). The latter may also in- 
clude some work in medical sociology and cultural anthropology (90). Among 
the specific courses most often taught by psychologists, as reported in a 1955 
survey (64), were psychometric or psychodiagnostic techniques, general in- 
troductory psychology, abnormal psychology, the psychology of personality, and 
the professional role of the psychologist in a medical setting. Among the other 
courses offered in at least a few medical schools were developmental psychol- 
ogy, electroencephalography, psychotherapy, and research methods in clinical 
psychology. Since the time of this survey, there has been an increase in “basic 
science” courses, such as physiological and experimental psychology (59). 

The students taught by psychologists in medical schools include general 
medical students in both their basic science and clinical years, as well as psy- 
chiatric residents. Increasingly, psychologists are participating in the training of 
residents in other specialties, such as pediatrics, neurology, and ophthalmology. 
Psychologists also contribute to the training of students in certain allied pro- 
fessional fields, including nursing, occupational therapy, physical therapy, and 
social work (64). à i 

Much of the time of psychologists employed in medical schools is spent, not 
in formal course instruction, but in other functions that may indirectly serve 
an instructional purpose (64, 66). Thus psychologists participate in case con- 
ferences, in collaborative diagnostic studies, in pyschotherapy, and in joint 
research projects with members of other departments. Under certain circum- 
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stances, counseling and even short-term psychotherapy with students may have 
a training function, especially when directed toward the improvement of inter- 
personal relations with patients. 

In dental schools, for example, there is little or no course work in psychology, 
although about half of these schools do employ psychologists, usually on a 
part-time basis (66). In these schools, psychologists perform chiefly testing and 
counseling functions. They may also conduct intensive workshops for practicing 
dentists (e.g, 37, 38). Insofar as these procedures provide increased self- 
knowledge and a better understanding of interpersonal relations with patients, 
they are similar to some of the techniques employed in management develop- 
ment programs (see Chapter 5). They should thus be regarded as an integral 
part of professional training. 

In both formal and informal contacts with students and professional col- 
leagues, the psychologist can also make a distinctive contribution through his 
research orientation. He can transmit relevant research findings, not only to 
students in basic science courses, but also to clinical instructors. Drawing upon 
his knowledge of research methodology in behavioral science, he can instill a 
more critical and inquisitive attitude toward the results of research. The 
prospective medical practitioner needs to develop some skill in the evaluation 
of research findings. If he is to keep abreast of current developments in his 
medical practice, he must be able to assess published reports of new research, 
to detect possible sources of error in the data, and to determine what conclu- 
sions can properly be drawn from the results. Still another way in which a 
research orientation can help the medical practitioner is in patient diagnosis. 
Insofar as the diagnostic process requires the formulation and testing of a 
series of interlocking hypotheses, research training should improve its effective- 
ness. Medical diagnosis thus represents one more example of the potential con- 
tribution of research training to professional practice, as discussed in several 
earlier contexts in this book (see Chapters r, 4, 13). 


PSYCHOLOGY IN MEDICAL RESEARCH 


Overview. One of the chief contributions that psychologists can make to 
medicine is in the area of research. Both in medical schools and in other medi- 
cal settings, the number of psychologists engaged in interdisciplinary research 
projects is growing rapidly. In this research, psychologists collaborate not only 
with medical practitioners but also with specialists in other basic sciences, such 
as physiology, neurology, biochemistry, and pharmacology. Among the well- 
established areas of such collaborative research are electroencephalography 
(23), motion sickness, and the environmental stresses of space travel (Chap- 
ter 8). The latter two areas illustrate the effective collaboration of psychologists 
with medical personnel and other biological scientists in the military services. 
It is also interesting to note that one of the events leading to research on 
sensory deprivation (Chapter 8) was the observation that polio patients con- 
fined in respirators for long periods of time experienced feelings of strangeness, 
hallucinations, and other cognitive disturbances (88). 

A relatively new but rapidly growing area of collaboration between psy- 
chology and medicine is to be found in psychopharmacology (33, 68, 92). 
Much of the psychological research on drugs pertains to their use in the under- 
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standing and treatment of psychiatric disorders (Chapter 15). The total con- 
tribution of psychology to pharmacology, however, covers other medical uses of 
drugs, including relief of pain and specific therapies for physical disorders. It 
is being increasingly recognized, for example, that any drug designed for 
medical use should be tested for “behavioral toxicity,” that is, for any adverse 
side effects it may have upon psychological functioning (92, ch. 7). For this 
purpose, almost every available kind of psychological test has been used, from 
measures of simple sensory and motor functions to tests of complex intellectual 
and personality traits (92). 

In the testing of drugs prior to their release for medical use, psychological 
research techniques are being employed at every stage, from preliminary animal 
research, through experimental studies with human subjects, to clinical tests. 
In animal research, behavioral measures are proving more sensitive than phys- 
iological measures in detecting drug effects, and psychologists are now trying 
to develop a standardized battery for testing the effects of any new drug on 
animals. Skinner boxes and other refined instruments for measuring animal 
behavior are beginning to appear in the pharmacology departments of medical 
schools. Drug companies are also beginning to employ psychologists to conduct 
drug research with animals (see, e.g., 8o). 

in both experimental studies on normal human volunteers and clinical field 
tests on patients, psychology has much to offer to methodology. Examples of 
these contributions include the use of psychological tests or rating scales for 
objective and quantitative assessment of behavioral changes, the application of 
appropriate sampling and matching procedures in assigning subjects to experi- 
mental groups, the utilization of the double-blind technique (Chapter 15) to 
control the influence of suggestion on subjects and observers, and the designing 
of experiments that permit effective statistical analyses of results. 

Research on psychopharmacology, as well as in other areas of medical and 
psychological collaboration, is also contributing to basic knowledge about be- 
havioral science. From the standpoint of the research psychologist, collabora- 
tion with medicine opens up new research settings in hospitals, clinics, and 
other agencies and provides access to many types of subjects not heretofore 
available. It also permits the use of techniques such as drug administration 
that require medical collaboration. Apart from their therapeutic values, drugs 
are being used experimentally to produce temporary and limited "biochemical 
lesions" in the nervous system. By interfering with selected biochemical events 
these techniques can shed light on the operation of different parts of the nerv- 
ous system in normal behavior. For this purpose, drugs have several advantages 
over earlier methods of extirpation and other types of surgical intervention. 
Not only is drug administration a simpler and better controlled procedure than 
surgical techniques, but the effects of drugs are also reversible. Because of this 
reversibility and the reduction of risk, some drug research can be safely con: 
ducted with human subjects. The precision with which drug administration 
can be controlled and manipulated, moreover, makes possible a clearer delinea- 
tion of the physiological mechanisms underlying behavioral changes. Psycho 
Pharmacology illustrates the reciprocal contribution of collaborative research 
to both medical practice and the advancement of psychological knowledge. 

Understandably, it is in physiological psychology that the bulk of collabora: 
tive research with medicine is to be found. Currently, the greatest activity is 
occurring in the newly emerging field of psychopharmacology, as the above 
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discussion suggests. Nevertheless, research contacts between psychology and 
medicine are not limited to physiological problems. Other areas of psychology 
have developed specialized research methodologies that are applicable to medi- 
cine. Of the many potential applications, three will be described below to 
illustrate the variety of possible contributions from both physiological psychol- 
ogy and other areas. 

Nature and Alleviation of Pain. The subject of pain is obviously of con- 
cern to medical practitioners, while falling clearly in the domain of psycho- 
logical research. Pain and its relief have been investigated by psychologists, 
physiologists, neurologists, and pharmacologists, among others (10). Collabora- 
tive projects cutting across these fields, however, are of relatively recent origin. 
Specific studies have ranged from introspective analyses to the identification of 
sensory receptors and neurological correlates of pain. Their objectives have also 
varied widely, extending from the formulation of a basic theory of pain to the 
evaluation of individual techniques for its elimination or diminution. Among 
the techniques that have been explored for the relief of chronic pain, as in 
cancer patients, are prefrontal lobotomy (see Chapter 14), drugs, hypnosis, 
and placebos (4). 

A noteworthy finding of much of the psychological research on pain is that 
pain-relieving agents may eliminate the discomfort and anxiety associated with 
pain without raising the pain threshold or altering pain sensations themselves. 
Thus the individual can still identify the pain, localize it, and discriminate it 
from other sensations such as touch or temperature, but he no longer finds it 
intolerable or unpleasant (4, 8). Physiological alterations, such as blood pres- 
sure changes, ordinarily associated with the discomfort of pain may also dis- 
appear under these conditions. 

Laboratory studies of experimentally produced pain have yielded inconsistent 
results (4, 43). This is not surprising in view of the many methodological 
problems presented by pain research. The more carefully controlled investiga- 
tions, however, have failed to find any significant change in pain threshold 
following administration of such analgesics as aspirin, codeine, and morphine 
(see, e.g., 25, 27, 91). What differences were obtained could be attributed to 
placebo effects and other extraneous variables, For example, one investigator 
found that phenobarbital produced threshold changes similar to those produced 
by morphine (25). Since the side effects of phenobarbital and morphine in the 
dosages employed were indistinguishable to most of the subjects, phenobarbital 
served as an “active placebo” in this experiment (see Chapter 15). 

Of particular methodological interest was the finding that repeated stimula- 
tion of the same spot on the skin produces a consistent rise in pain threshold 
(25, 91), as illustrated in Figure 119. Such a finding calls into question the 
positive conclusions of earlier drug studies employing repeated stimulation of 
a single spot. It also points up the multiplicity of variables that must be con- 
trolled in this type of research and the complex experimental designs required 
to achieve adequate control. Another relevant consideration is that there may 
be different varieties of pain, which are mediated by different neurological 
mechanisms. Drugs that relieve one type of pain may thus yield negative results 
with another type. 

Process of Medical Diagnosis. A very different kind of research is illus- 
trated by exploratory studies of the process of medical diagnosis and by the 
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Fig. 119 Effect of repeated stimulation of same spot on pain threshold for 
radiant heat. (From Sweeney, 91, p. 46.) 


development of techniques for evaluating and training such diagnostic skills. 
Rimoldi and his associates (74, 75) have developed the Test for Diagnostic 
Skills, which focuses on the diagnostic process rather than on the final diag- 
nosis. After being given preliminary information about a case, including ad- 
mission data and chief complaints, the examinee proceeds by requesting 
additional information as needed. Such information, which he obtains by 
choosing the appropriate cards, covers questions he might ask the patient, 
manipulative techniques he might wish to use in his examination of the patient, 
diagnostic tests he might order, and so forth. The particular items requested, 
as well as the order in which they were requested, provide the basis for 
evaluating the individual's diagnostic performance. 

Several types of scores can be found with the Test of Diagnostic Skills. 
Some of these scores are based on the number of questions asked and the 
utility of these questions in terms of the final diagnosis; others pertain to the 
order in which the questions are asked. The utility of a question is determined 
by the frequency with which it was asked by a criterion group, such as 
practicing physicians or medical school faculty. The sequence of questions is 
analyzed by newly developed techniques of pattern analysis, some of which are 
being adapted for machine scoring. 

In an exploratory study, the Test of Diagnostic Skills was administered to 
100 junior medical students, 23o senior medical students, and 5o physicians, 
drawn from five medical schools (74, 75). The responses of these three groups 
differed in a number of ways. With regard to number of questions asked, the 
junior students asked the most and the physicians the least. These differences, 
furthermore, were largely restricted to items dealing with history and inter- 
view data. The junior students asked questions that were often irrelevant or 
redundant, or they tended to follow wrong clues. With the physician group 
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Fig. 120 Utility of acquired information as a function of order of questions 
for skilled and unskilled diagnosticians. (From Rimoldi, 74, p. 77.) 


as a criterion, utility scores computed for the students were higher for seniors 
than for juniors. Thus, in the specific questions asked, the seniors agreed more 
closely with physicians than did the juniors. These group differences, too, 
were found largely with questions pertaining to history and interview informa- 
tion. 

OF special interest is the analysis of these utility values in relation to se- 
quence of questions. As the individual approaches the final diagnosis, the 
cumulative utility value of the information he has gathered rises. The more 
skillful diagnostician reaches a high utility value more quickly than does the 
poorer diagnostician, as illustrated in Figure 120. The cumulative index for 
student A rises almost as rapidly as is theoretically possible. That of student 
B, on the other hand, shows a plateau between questions 1 and 4. Student B 
accumulated practically no relevant information between these questions. 

Mention should also be made of current research on the possible uses of 
computers and other types of automated information-processing systems as aids 
in medical diagnosis (29, 70, 87, 93, 95). At the start, these techniques are 
being explored with reference to relatively specific diagnostic problems, such 
as the interpretation of records from electroencephalographs, electrocardio- 
graphs, radiographs, and polygraphs (29). Another preliminary approach in- 
volves statistical analyses of variables for the diagnosis of a single disease, such 
as coronary artery disease (95). Multiple regression equations can be utilized 
for this purpose, just as in the development of personnel selection batteries 
(see Chapter 3). Eventually, computer techniques will be applied to the 
processing of more complex and varied combinations of diagnostic data. 

Norms and Related Measurement Problems. Insofar as medical diagnosis 
utilizes measurement, it requires norms for the interpretation of results. In 
evaluating data on body weight, heart rate, blood pressure, biochemical anal- 
yses, or any other objective measure, the physician either explicitly or implicitly 
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compares the results with those expected under normal conditions. This process 
requires a knowledge of typical results, as well as of the range of "permissible 
variation." Available norms, however, are often crudely expressed and based 
on inadequate samples and unsystematic surveys (6o, 69, 87, 99, 103). 

In a 1951 article in the British Medical Bulletin, Wootton, King, and Smith 
(103) wrote: “Probably the most common use of blood analysis is in diagnosis. 
This involves comparing the results obtained from the patient being examined 
with those found in similar persons in normal health. It is therefore surprising 
how inadequate is the detailed information on the results obtained from normal 
subjects" (103, p. 307). In an effort to improve this situation, these investi- 
gators analyzed blood samples from 100 normal volunteers of both sexes be- 
tween the ages of eighteen and fifty. A few of the blood constituents measured 
in this sample yielded distributions closely approximating the normal curve, 
as illustrated in Figure 121; but the majority yielded highly skewed distribu- 
tions, as illustrated in Figure 122. Consequently, the use of means and standard 
deviations would have been inappropriate. Instead, the authors reported norms 
in terms of percentile measures, that is, the ranges covered by the middle 8o 
per cent and by the middle 98 per cent of the cases. 

Published tables of biological norms frequently give means and variability 
estimates that may be misleading because of the form of the distribution. With 
skewed distributions, the median or mode would be more appropriate than the 
mean as a measure of central tendency. Normal range is often estimated by 
taking a distance of approximately two standard deviations on either side of 


Fig. 121 Distribution of a blood constituent in a normal sample: plasma 
albumin. (From Wootton, King, and Smith, 103, p. 307.) 
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Fig. 122 Distribution of a blood constituent in a normal sample: bilirubin. 
CFrom Wootton, King, and Smith, 103, p. 308.) 


the mean, with no knowledge of normality of distribution. For some physio- 
logical variables, moreover, the published estimates of range are based, not on 
data analysis, but on the subjective judgment of an investigator "experienced 
in measuring the quantity in question" (89). 

Apart from the possible use of inappropriate statistical measures, available 
biometric norms have other deficiences. The samples from which they are 
derived are often small and unrepresentative. When compared with the 
normative samples employed in standardizing psychological tests, most of the 
samples utilized in biometric surveys appear quite inadequate. Published ranges 
of the concentration of different blood constituents, for example, are often 
based on samples of fifty cases or less, and sometimes on as few as five cases 
(99, p. 572. 

The need for systematic medical norms stems from the wide variability found 
among normal persons. In a book entitled Biochemical Individuality, Williams 
(99) reports data gathered in his own laboratories, as well as published find- 
ings of other investigators, showing wide individual differences in metabolic 
patterns, as indicated by the chemical composition of saliva and urine. In the 
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same book, he cites the extensive individual differences observed in virtually 
all anatomical and physiological variables, from stomach shape and size to 
heart rate. In his writings, Williams has repeatedly urged that medical prac 
tice pay more heed to individual differences in bodily functions and that 
adequate data regarding such differences be collected. 

Among the most extensive collections of biometric data are those pertaining 
to height and weight (69). Many thousands of cases have been included in 
the routine measurement of infants, schoolchildren, military recruits, inmates 
of prisons and other institutions, and applicants for life insurance policies. 
Special groups such as industrial workers and students have also been surveyed. 
Despite the impressive size of these samples, however, representative data are 
not available for certain segments of the population, such as adult women. For 
medical use, moreover, weight norms need to be specified with reference to 
age, sex, height, and possibly other anthropometric indices, such as chest cir- 
cumference. For such detailed breakdowns, representativeness of sampling is 
even more important than for general group comparisons. The variability of 
norms in different regions and at different times must also be taken into ac- 
count. Temporal changes in physical characteristics of populations call for 
periodic revisions of norms (69). 

In the diagnostic use of quantitative data, some attention should also be 
given to errors of measurement. These errors result in part from temporal 
fluctuations in the individual’s own condition. For many functions, systematic 
data on normal intraindividual fluctuations are lacking (87). Variations also 
occur in the results obtained by different examiners or by different laboratories 
that perform diagnostic tests (87, 103). All these types of measurement error 
are analogous to those assessed by the reliability coefficients of psychological 
tests, and similar procedures could be used to estimate their magnitudes. A 
knowledge of the measurement error inherent in diagnostic measurements is 
essential for their proper interpretation. In addition, the extent of such measure- 
ment error can often be reduced by controlling sources of variation through 
further standardization of procedures. An obvious example is that of controlling 
the previous exertion and present emotional state of the patient in measuring 
heart rate. A 

In many diagnostic measures the distributions of healthy and pathological 
cases overlap. Empirical determination of the extent of such overlap would 
obviously be helpful. Since diagnosis is usually based on a combination of 
measures, statistical studies of combinations and patterns of variables would 
likewise be fruitful (103). Finally, there is need for more longitudinal studies 
to determine the predictive validity of diagnostic measures. In the previously 
mentioned article, Wootton, King, and Smith (103) cite a longitudinal study 
conducted with infants suffering from haemolytic disease of the newborn, a 
condition resulting from blood-group incompatibility of mother and offspring. 
Several indices of the severity of the disease were checked against subsequent 
mortality. The most valid proved to be the haemoglobin concentration in the 
blood of the umbilical cord. With this index, an expectancy graph was pre- 
pared with which an infant's chances of survival could be estimated. These 
estimates are useful, not only in individual prognosis, but also in evaluating 
the effectiveness of different therapies. F A 

It is apparent that the collection and analysis of quantitative data for use in 
medical diagnosis involve many concepts and techniques familiar in psycho- 
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metrics. It would thus seem that the collaboration of psychometrically trained 
psychologists with medical statisticians, pathologists, and other medical special- 
ists should prove fruitful. So far, however, the contributions of psychologists 
to this area of medicine have been minimal. 


PSYCHOSOMATIC MEDICINE 


Since midcentury, the term "psychosomatic" has become familiar, not only 
in medical and psychological literature, but also in popular speech. Much 
publicity has been given to such "psychosomatic illnesses" as peptic ulcer, high 
blood pressure (essential hypertension), rheumatoid arthritis, and bronchial 
asthma. Conceptually, psychosomatic disorders can be differentiated from 
malingering, in which the individual deliberately fakes or falsely reports a 
symptom to gain some ulterior end. They can also be differentiated from such 
neurotic symptoms as hysterical paralyses and loss of sensitivity, for which no 
organic basis can be found (see Chapter 13). The term psychosomatic is 
typically applied to those physical disorders in which emotional stresses have 
led to identifiable organic pathology. We must hasten to add, however, that 
some writers use the term more broadly and with varied emphases. Moreover, 
the above threefold distinction is not always easy to apply in practice. And any 
individual patient may exhibit symptoms classifiable under two or even all 
three types. 

Historical Development. ‘The concept of psychosomatic illness is an old 
one, dating from some of the earliest recorded writings (see 100, ch. 3). Even 
the term psychosomatic is not very new, having been proposed at least as early 
as 1818 (100, p. 41). Nevertheless, the current psychosomatic movement in 
medicine and the widespread recognition of psychogenic factors in physical 
disorders is of recent origin. The publication in 1935 of Dunbar's book on 
Emotions and Bodily Changes (21), which surveyed relevant world literature 
of both experimental and clinical nature, did much to launch the movement. 
The Journal of Psychosomatic Medicine began publication in 1939, and the 
Psychosomatic Society! was organized in 1942. The high incidence of psycho- 
somatic disorders among military personnel and among civilians exposed to 
wartime stresses during World War II also contributed to the growing concern 
with these problems. 

Several antecedent developments in psychology, physiology, and psychiatry 
contributed to the rise of the modern psychosomatic movement. A major con- 
tribution stems from research on the effects of psychological factors upon 
physiological functions, as studied in both man and animals. The pioneer 
animal experiments of Pavlov (72) on the conditioned salivary reflex and of 
Cannon (13) on physiological effects of intense emotional stimuli are out- 
standing examples of this research. The more recent investigations of Selye 
(83, 84) on the "general adaptation syndrome" following prolonged exposure 
to any kind of stress have thrown further light upon the role of certain 
hormones in mediating physiological responses to stress. More limited experi- 
mental studies on human subjects as well as clinical observations on patients 
1 When first established, this society was known as the Society for Research in Psy- 
€ Problems. In 1950, the name was changed to Psychosomatic Society (21, P- 
732- 
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provided additional data on changes in blood pressure, respiration, gastric 
motility, glandular secretions, and other physiological reactions resulting from 
emotional stress. Such research has established that temporary and reversible 
physiological changes accompany anxiety and other emotional states. In addi- 
tion, there is evidence that excessive and continued emotional stress may lead 
to chronic disturbances of physiological functions and to permanent tissue 
damage. 

Among the other developments that contributed to the emergence of psycho- 
somatic medicine may be mentioned constitutional type theories (sce 2, ch. 6). 
These typologies have a long history but are best known to psychologists 
through the work of Kretschmer (53) and Sheldon (85). The earliest exposi- 
tions of constitutional type theories were concerned with the relation between 
body build and susceptibility to different diseases. In later formulations, body 
build came to be associated with characteristic personality patterns as well as 
with proneness to specific diseases. Such theories thus helped to stimulate in- 
vestigations into the personality characteristics of persons suffering from given 
diseases. 

Still another major influence, especially evident in psychiatric writings on 
psychosomatic medicine, is psychoanalysis (see 21, 36). Beginning with Freud, 
many psychoanalysts have proposed theories regarding the role of conflicts, 
repressions, and other psychoanalytic mechanisms in the development of physi- 
cal diseases. These theoretical speculations, bolstered by selected examples 
from clinical practice, have often led to descriptions of the typical personality 
of patients suffering from different psychosomatic disorders. Such popular 
phrases as "ulcer type" and "coronary personality" have their origins in these 
theories. Some psychoanalysts have approached the problem through the sym- 
bolic psychoanalytic significance of different organ systems, which allegedly 
determines the specific psychosomatic disorders each person is likely to develop. 

Research Methodology. Current research on psychosomatic problems re- 
flects the influence of these varied approaches. Following the earlier explora- 
tory work, animal experiments have been more directly concerned with the 
production and alleviation of specific psychosomatic symptoms (1; 7; 11; 12; 48, 
PP- 49-51, 63-69; 78; 79; 96). By exposing rats, dogs, cats, monkeys, and other 
animals to conflictual and stressful situations, experimenters have been able to 
induce peptic ulcers, asthmalike symptoms, and other familiar psychosomatic 
disorders, ; ; 

In one such experiment (11, 12), pairs of monkeys were placed in restrain- 
ing chairs where they could be given brief electric shocks on the feet every 20 
seconds. By pressing a lever at least once during each 20-second interval, the 
experimental member of the pair (facetiously designated as the —— 
monkey) could avoid the shock for both. Figure 123 shows the Cp 
and control monkeys in their restraining chairs. Both have levers within easy 
reach, but the lever on the side of the control animal is inactivated. Although 
both animals were subjected to the same physical stress, consisting of equally 
frequent shocks of the same intensity, only one was under the additional psy- 
chological stress of having to press the lever. Physical observations both during 
the experiment and after death revealed marked gastrointestinal ulceration " 
the experimental monkeys, while the control animals showed no gastrointestina 


complications. x k loyed i 
It is thus apparent that neither the restraint nor the shocks employed in 
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Fig. 123 Monkeys participating in experimental investigation of peptic 


ulcers. Animal on the left can avoid shock for both by pressing lever; control 
animal's lever is inactivated. (See Brady, 11, 12; United States Army 
photograph.) 


this experiment could account for the ulceration. The essential factor centered 
in the responses required of the experimental member of the pair. Physiological 
records indicated an excess secretion of gastric acid in the experimental mon- 
keys, which was the immediate physiological cause of ulcer development. The 
"rest" and "work" schedules followed in the experiment also appeared to be an 
important factor in the situation. When six-hour rest periods with the shock 
turned off were alternated with six-hour work periods during which the animal 
must press the lever to avoid shock, most experimental monkeys developed 
ulcers. With other schedules, requiring either longer or shorter work periods, 
ulcers failed to develop. It is possible that the relation of the stress periods to 
some physiological cycle is a relevant factor, but more research is needed to 
identify the exact etiological mechanism. 

Short-term experiments of a milder nature have been performed with human 
volunteers (48, pp. 48-49; 100-102). The subject is exposed to a stressful 
stimulus while various physiological measures are recorded and compared w ith 
those obtained under emotionally neutral circumstances. In clinical situations, 
the discussion of an emotionally charged topic may serve as the stressful situ 
ation. In such experiments, it is essential that the situation be one that has 
genuine emotional significance for the particular subject. This requires con- 
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siderable knowledge about the individual's attitudes, values, and prior experi- 
ences, With suitable emotional stimuli, physiological changes have been demon- 
strated in a variety of bodily functions, including respiration, blood pressure, 
heart rate, sweat secretion, muscle potentials in different parts of the body, 
and acid secretion in the stomach. 

Of particular interest are the studies of a few patients who as a result of 
injury or for therapeutic reasons have a gastric fistula that permits direct ob- 
servation of the gastric mucosa, or inner lining of the stomach (24, 101). 
Figure 124 illustrates results obtained with one of these patients during a 
conversation that aroused anger and hostility. It can be seen that stomach con- 
tractions were intensified, hydrochloric acid production was more then doubled, 
and the gastric mucosa reddened as it became engorged with blood. 

A third and related approach consists in the longitudimal observation of 
patients with psychosomatic disorders. Through such continuing observations, 
periods of remission or exacerbation of the physical symptoms can be correlated 
with life experiences. Published reports of such studies indicate a tendency for 
the patient's physical condition to worsen during periods of emotional stress. 
Because of the difficulty of maintaining contact with patients for long periods, 
an adaptation of this method utilizes case-history data (19). Some practitioners 
employ a "life chart" in which each row corresponds to a year in the patient's 
life. In appropriate columns on this chart, they record the patient's medical 
condition and any significant life events occurring during the year. Again pub- 
lished records indicate fairly close correspondence between experiential factors 
and the course of such disorders as peptic ulcer and rheumatoid arthritis (19). 


Fig. 124 Changes in gastric mucosa observed in a subject with a gastric 
fistula during an emotionally stressful situation. (Wolf and Wolff, 101, 
v. 298. From Contributions toward medical psychology, edited by Arthur 
Weider. Copyright 1953, The Ronald Press Company.) 
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The analysis of life charts is less satisfactory than direct longitudinal observa- 
tion, since it must rely upon the patient's report, and it leaves room for selective 
bias on the part of both patient and investigator. Both methods, moreover, are 
limited in that contact with the subject occurs after he is in an advanced stage 
of some somatic disorder. Physiological studies of individuals in the process of 
responding to life stresses, prior to the onset of pathology or during an early 
stage in the development of the disorder, are more illuminating, though more 
difficult to conduct. One such longitudinal study of Army inductees during 
World War II demonstrated that both physiological and personality variables 
contribute to the development of ulcers under stressful conditions (97). 

From a different angle, psychiatrists and psychologists have investigated the 
personality characteristics of patients suffering from a specific physical disorder. 
Much of the literature on psychosomatic medicine is concerned with the results 
of this type of study (see 21, 36, 41, 57, 61). Many of these published per- 
sonality descriptions are derived from selected cases observed under uncon- 
trolled clinical conditions. Data were usually gathered by interviewing. When 
tests were employed, they were often of questionable validity. More objective 
studies with random samples and moderately well-matched control groups 
have usually failed to confirm the hypothesized relations between personality 
variables and type of disorder. 

A further difficulty in the application of this method is the impossibility of 
inferring causal relations. If, for example, coronary patients manifest excessive 
anxiety, we do not know whether this condition existed prior to the onset of 
the physical disability or whether it is the patient's reaction to his physical 
handicap and to the frustrations it imposes. Only a longitudinal study begun 
prior to the onset of the physical disorder could answer this question. A further 
complication arises from the possible behavioral effects of reduced blood supply 
to the brain, changes in blood content, and other physiological aspects of the 
disease itself (31, pp. 101—102). Drugs administered to treat the physical dis- 
order may also influence emotional and intellectual responses. 

A fifth approach, which is receiving increasing attention, is that of epide- 
miology. By comparing the relative frequency of a disorder in different segments 
of the population, the epidemiologist can test hypotheses about causal influ- 
ences (15). Analyses of the prevalence of certain diseases in different oc 
cupational groups provide one example of such an approach. It is noteworthy 
in this connection that surveys conducted in several countries have found peptic 
ulcer to be significantly more common among foremen than among craftsmen 
or higher-level executives (22). Contrary to popular belief, it is not the top- 
level executive who is most likely to develop ulcers. Although much more 
information is needed to arrive at a causal interpretation of such findings, one 
of the proposed explanations is that role conflicts may increase the emotional 
stress of the foreman’s job. 

Another application of the epidemiological approach is illustrated by a series 
of studies in progress in a recently industrialized area of North Carolina (15; 
16). In these studies, the investigators set out to test the hypothesis that the 
incongruities and conflicts presented by cultural change, as in migration or in 
the industrialization of a farming community, produce stresses likely to result 
in increased rates of psychological, somatic, and social ill health. Over the 
preceding half century, the community under investigation had changed from 
a mountain village of approximately one hundred population to a small in- 
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dustrial city of about five thousand population. Since the population had re- 

mained racially quite homogeneous and was drawn almost wholly from the 

nearby farming area, this community provided a relatively controlled situation 
for study. As hypothesized, the “second-generation” factory workers, whose 
fathers were also employed in industry, had fewer health problems than the 
“first-generation” employees, whose fathers were farmers. The emotional stresses 
| accompanying the readjustments and culture conflicts of these first-generation 
factory workers were thus reflected in poorer health conditions (16). 

Current Trends. ‘Today it is being increasingly recognized that the rela- 
tion between emotional stress and somatic disorders is more complex than that 
implied in the personality-type theories. Because of methodological weaknesses, 
studies purporting to show the association of characteristic personality profiles 
with specific psychosomatic syndromes are of questionable value. An examina- 
tion of the proposed personality descriptions, moreover, reveals considerable 
similarity among the traits attributed to persons with different diseases, such as 
peptic ulcer, hypertension, and rheumatoid arthritis (see, e.g, 41, pp. 205- 
208). Empirical observations, in fact, suggest that any given psychosomatic 
symptom could be the expression of several different emotions and that a given 
emotion could lead to a variety of psychosomatic symptoms (36, ch. 5). Such 
a finding is also consistent with present knowledge about the transient physio- 
logical expressions of emotional states (56). It is likewise relevant to note that 
shifts from one type of symptom to another, as from asthma to eczema, are not 
unusual (36, p. 66; 41, p. 203). 

A parallel trend in psychosomatic medicine is to regard all diseases as having 
both psychogenic and somatogenic components, rather than labeling a few as 
psychosomatic. This trend is illustrated by the extension of psychosomatic medi- 
cine to include more and more specialties, such as surgery, gynecology, ophthal- 
mology, and dentistry (21, chs. 1 and 17; 36). But, at a more basic level, the 
concept of psychosomatic medicine is being broadened to mean a consideration 
of the total reacting organism in his environment. Psychosomatic medicine thus 
stands for a point of view applicable to the practice of all medicine, rather 
than representing a branch of medicine. This approach will be recognized as 
virtually identical with that of comprehensive medicine, described earlier in this 
chapter. 


SOMATOPSYCHOLOGY 


General Nature. Psychosomatic medicine began with investigations of the 
somatic effects of such psychological factors as emotional stress. Beginning at 
the opposite angle, some psychologists have been studying the individual's 
psychological reactions to physical illness or disability. Barker and his associates 
Cs, 6, 105) have coined the term "somatopsychology" for this field. Although 
this is a new and relatively unexplored research area, it has attracted wide- 
spread interest through its practical implications for rehabilitation programs. 

In somatopsychology, the individual's body is regarded "as a tool for be- 
havior, and as an object with social significance to himself and others (6, 
P- 419). For example, an orthopedic disability that seriously affects locomotion 
and other motor responses will interfere with social interaction, with schooling, 
and with other normal childhood activities. Such experiential limitations, un- 
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less remedied by other means, will retard intellectual growth and alter per- 
sonality development in various ways. Another way in which conspicuous 
physical anomalies may affect psychological development is through the reac- 
tions of associates. Pity, avoidance, oversolicitousness, rejection, repulsion, and 
many other emotionally charged attitudes tend to make the disabled indi- 
vidual's social environment different from that of physically normal persons. 
The individual's self concept and his own attitude toward his disability are 
likely to be profoundly affected by his atypical experiences. 

Long-term Psychological Effects. Exploratory studies have been con- 
ducted on the psychological effects of many types of physical disability, in- 
cluding deafness, blindness, amputation, facial disfigurement, tuberculosis, 
poliomyelitis, cerebral palsy, multiple sclerosis, and paraplegia, among others 
C5, 3o, 31, 86, 105). Patients in these different categories have been investi- 
gated through intensive case studies, interviews, questionnaires, and psycho- 
logical tests. An example of a projective technique developed especially for 
studying the attitudes of amputees toward their disability and toward the 
wearing of artificial limbs is given in Figure 125. Patterned after the 
Rosenzweig Picture-Frustration Study, this test consists of nine cartoons por- 
traying potentially awkward or embarrassing social situations that may be 
encountered in daily life by an arm amputee (73, 86). The subject’s responses 
are scored with regard to self-acceptance, reality facing, and desire for in- 
dependent function. 

The results of such studies show that there is no consistent association be- 
tween specific forms of disability and particular personality characteristics. 
There are no psychological traits that uniformly differentiate, let us say, the 
blind, the deaf, or the orthopedically handicapped from persons with other 
disabilities or from physically normal persons. As a group, the severely dis- 
abled tend to show some maladjustment, in the form of anxiety, depression, 
and social isolation. Even these group differences, however, are slight, and the 
overlapping with physically normal persons is extensive, 

Studies of physically disabled individuals, on the other hand, reveal that the 
physical disability often has a profound effect upon behavior. The discrepancy 
between individual and group results arises from the fact that the psychological 
effect of disability takes many different forms in different persons. The end 
result in the individual’s behavior depends upon a three-way interaction among: 
(1) nature and severity of handicap; (2) individual personality and ability 
variables; and (3) situational variables in home, school, job, and social con- 
texts Gi 6; 105, ch. 15). Heterogeneity in the effects of specific physical dis- 
abilities upon individual behavior is one of the best-established findings of 
somatopsychology. The individual's attitude toward his disability is an im- 
portant link in the chain of circumstances leading to particular behavioral 
effects. 

Immediate Reactions to Illness, While the above research pertains to 
long-range and relatively permanent effects of physical disability upon intel- 
lectual and personality development, several investigators have been concerned 
with the individual's immediate response to illness, hospitalization, and treat- 
ment. This approach is illustrated by an extensive series of studies undertaken 
cooperatively by psychologists in a number of Veterans Administration hospi- 
tals. In the first of these studies, data were obtained from nearly eight hundred 
male patients hospitalized for pulmonary tuberculosis (94). This disease was 
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Hostess (who knows nothing about the guest): 


| am so glod you were 
able tc come. Con | help 
you with some coffee 
and cake? 


Check One 
1. Thank you. I shall help myself to coffee and cake. = 
2. Thank you. I shall try to manage both, but if not would you 
mind if I find a small table? TM 
3. Thank you. I'll have some coffee. (Cake seems too hard to 
handle, John thinks to himself.) e 
4. Thank you. Would you mind helping me with the cake? I'll 
take the coffec. = 


5. Thank you. I'll appreciate it if you would set the coffee and 
cake down for me. — 
6. No thank you. (I'd like some, but what's the use, John 
thinks to himself.) — 
7. Thank you. Would you mind helping me? It would be 
somewhat easier — 
8. Thank you. I'll try and manage. (John thinks to himself, 
I'll bet she feels sorry for ine.) I 


Fig. 125 Typical item from projective test designed to assess attitudes of 
amputees toward their disability and toward the use of prosthetic devices. 


(From Peizer, 73, p. 52.) 


chosen because it presents a particularly difficult adjustment problem during 
treatment. Because the disease is contagious, the patient must be isolated. 
Treatment is slow and frustrating, the patient being hospitalized for long 
periods during which he may feel perfectly fit. In addition, the threats of 
possible lung surgery and permanent disability evoke anxiety and depression. 

In the study, rating scales, interviews, and questionnaires were used to 
arrive at criterion indices of hospital adjustment and of subsequent community 
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adjustment. These índices were correlated with biographical data and with 
scores on a battery of psychological tests. One of the suggestive findings of the 
study was that the type of person who adjusts well to the hospital tends to 
make a poorer community adjustment after discharge. Conversely, those making 
a good community adjustment tended to be more resistant to the regimentation 
and inactivity typical of the tuberculosis ward. The passivity, acquiescence, 
and dependence conducive to good hospital adjustment contrast sharply with 
the personal qualities demanded by the posthospital environment. Only one 
psychological variable, anxiety, was found to be detrimental to both hospital 
and community adjustment. 

A more intensive investigation of 32 patients in the tuberculosis division of 
a county hospital was conducted by Derner (20). Information was obtained 
through a series of guided interviews, self-report inventories, and projective 
techniques. The results gave no evidence of a characteristic personality pattern 
associated with tuberculosis, although feelings of apprehension, fearfulness, 
and depression were common. The patients’ responses also indicated a pervasive 
concern with their illness and with bodily condition in general. An attempt to 
check on the alleged euphoria and undue optimism of tubercular patients 
yielded inconclusive results. Several findings pointed to an underlying conflict 
between dependency needs and independency needs in the course of hospital- 
ization and rehabilitation. Patients differed widely in their response to this 
conflict, as was also suggested by the findings of the more recent and more 
extensive Veterans Administration study cited above. 

Patients’ reactions to a diagnosis of cancer and to their subsequent hospital- 
ization and treatment also provide a fruitful field for psychological research 
(18). Because of the high death rate from some forms of this disease, insuffi- 
cient knowledge about its nature, the possibility of disfiguring surgery, and the 
many superstitious beliefs associated with it, cancer engenders unusually high 
anxiety in most patients. Attitudes of relatives also play an important part in 
the patient's reaction to this disease. The psychologist in a cancer hospital can 
do much to reduce misunderstandings and allay anxieties. 

Even with terminal cancer patients, psychological counseling for the accept- 
ance of death has much to contribute to the patient’s emotional well-being 
(18). To perform effectively in this role, the psychologist needs much more 
information than is now available about individuals’ attitudes toward their 
own death. This is an area of psychological research that has been largely 
neglected, but its importance is gradually coming to be recognized by psycho- 
logical investigators, A beginning has been made in exploring attitudes toward 
death held by individuals at different age levels, from early childhood to old 
age (26, chs. 6-8). Surveys of terminally ill patients suggest, among other 
findings, that feelings of being abandoned and rejected often augment the 
patient's dread (18; 26, ch. 8). As might be anticipated, a given patient's 
reaction to imminent death depends upon many individual and situational 
variables. But most patients want to discuss their impending death and resent 
evasiveness and withdrawal by their associates. 

Relation between Patient Attitudes and Recovery. Still another type of 
investigation is concerned with the effect of patient attitudes upon recovery 
from illness. In the previously mentioned study of patients with pulmonary 
tuberculosis in Veterans Administration hospitals, biographical and test data 


C 


541 


Psychology and Medicine 


were also correlated with indices of recovery as determined from X-ray films 
and bacteriological tests (94). Favorable response to treatment was found to 
be significantly correlated with freedom from anxiety and with personal con- 
fidence. Rapid recovery also tended to be associated with an "experimenting," 
intellectual orientation as contrasted with a cautious and conservative attitude. 
Although these specific findings should be regarded as tentative and pre- 
liminary, the fact that patients in identical physical condition may respond 
very differently to treatment has long been recognized in clinical practice. 

Similar observations have been reported by physicians treating cancer 
patients, and some exploratory research has been undertaken to identify psy- 
chological characteristics associated with differential recovery rate. Independent 
research projects conducted at different hospitals have provided suggestive 
evidence that patients' attitudes toward their illness and other personality 
characteristics are related to the likelihood of recovery from certain forms of 
cancer (18, 32). Findings from joint psychological and biochemical research 
suggest the role of hormones as a possible mediating factor in this relationship. 
Although available information is far too meager to permit any conclusions at 
this time, preliminary results are sufficiently promising to justify further re- 
search along these lines. 

With these studies on the effect of attitudes upon recovery from physical 
illness we have come full circle and are back to a psychosomatic relationship. 
By now it should be apparent that the separation of different types of relation 
between psychological factors and physical illness, while desirable for purposes 
of analysis, cannot be sharply maintained in practice. The way in which an 
individual adjusts to the complex changes in his life pattern brought about by 
illness or physical disability depends to a large extent upon his pre-disability 
personality. If his reaction includes anxiety and depression, these attitudes will 
in turn exert a psychosomatic effect upon the subsequent course of the illness, 
as well as upon his ability to cope with the social, economic, and other problems 
that it engenders. Both psychosomatic medicine and somatopsychology can 
thus be recognized as two aspects of the general approach designated as 
comprehensive medicine. 


SUMMARY 


Although the role of psychological factors in medical practice has been recog- 
nized for centuries, it was not until after World War II that contacts between 
psychology and medicine attained significant proportions. Even now, the po- 
tential contributions of psychology to medicine are largely undeveloped. In 
medical practice, psychology can aid in diagnosis, treatment, rehabilitation, 
and prevention. Apart from his utilization of psychodiagnostic data, the physi- 
cian can profit from psychological knowledge in conducting medical interviews 
and in evaluating introspective reports by patients. In treatment, the role of 
interpersonal relations between physician and patient is widely recognized. The 
trend toward comprehensive medicine, with its emphasis upon the whole 
patient in his environment, has advanced the application of psychology to 
medicine. In rehabilitation, the psychologist typically functions as a member of 
a team composed of physicians and other professional workers. Preventive 
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medicine requires effective two-way communication between lay public and 
public health personnel. Its problems thus have much in common with those 
of consumer psychology. 

Since midcentury, the participation of psychologists in medical school teach- 
ing has grown rapidly, as manifested both by the increasing number of psycho- 
ogists so employed and by the broadening scope of their functions. A major 
contribution that psychology can make to medicine is in research. Examples 
of collaborative psychological research conducted in medical settings include 
electroencephalography, studies of environmental stresses of flight and space 
travel, psychopharmacology, investigations of the nature and relief of pain, 
analyses of the process of medical diagnosis, and the improvement of physio- 
logical norms and of diagnostic measurement. 

Modern psychosomatic medicine dates from the mid-thirties. Its antecedents 
stem from such varied sources as physiological psychology, constitutional type 
theories, and psychoanalysis. Current research techniques include the artificial 
production of psychosomatic disorders in animals, experimental investigation 
of the effects of stress upon human physiological functions, longitudinal ob- 
servations and case studies of patients with psychosomatic disorders, and per- 
sonality studies of patients with different physical syndromes, 

Somatopsychology begins from the opposite angle, focusing upon the psycho- 
logical effects of physical illness and disability. It is concerned with both long- 
range effects upon intellectual and personality development and immediate 
responses to illness, hospitalization, treatment, and impending death. Some 
Suggestive data have been reported on the effect of patients’ attitudes and 
other personality characteristics upon recovery from such illnesses as tuber- 
culosis and cancer. Both somatopsychology and psychosomatic medicine may be 
regarded as particular aspects of comprehensive medicine. 
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Psycholog y and Law 


With its heavy reliance upon precedent, the practice of law is character- 
istically resistant to change. Current laws as well as judicial procedures are 
often based upon outdated theories of human behavior and are at variance 
with experimentally established facts. This cultural inertia is generally de- 
fended on the grounds that law, dealing as it does with matters of life and 
death and other momentous decisions, must move cautiously. Thus it is argued 
that “new” findings, concepts, and techniques provided by a rapidly developing 
science such as psychology cannot be accepted as guides in legal practice until 
they have become so firmly established as to meet with virtual unanimity 
among psychologists themselves. 

The effect of this policy, however, is that the psychological principles ac- 
tually followed in legal practice are sometimes so manifestly obsolete as to be 
almost unanimously rejected by psychologists. In all practical action, decisions 
must be made in the absence of complete certainty. Under such circumstances, 
wisdom requires that all the information available at the time be utilized in 
choosing the best course of action. Modern medicine provides a good example 
of this approach. For most diseases, there is no therapy that guarantees 100 
per cent success. Nor would there be complete agreement about the best 
therapy to follow in any particular case. Yet this state of affairs is not used to 
justify a continuation of the medical practices of the Middle Ages. 

"Today there are several indications within the field of law that efforts are 
being made to bring legal practice into closer touch with modern society and 
with developments in both natural and social sciences. Among law professors, 
jurists, and law enforcement officers, a few outstanding individuals have en- 
couraged research into social problems and have introduced techniques and 
findings of modern science into their own work. Within psychology, there has 
been sporadic interest in legal problems during the past fifty years. Such early 
books as Burtt's Legal Psychology (16) and Robinson's Law and the Lawyers 
C116) illustrate a variety of ways in which psychologists may contribute to law. 
Still other facets of the Reld are covered in more recent books on Psychology 
for Law Enforcement Officers (32) and Legal and Criminal Psychology (138). 
Applied psychological research specifically concerned with legal problems is 
still meager. Teaching by psychologists in law schools is likewise limited to an 
occasional course here and there. In many different ways, however, psychol- 
ogists are beginning to establish contacts with the legal profession. In the 
following sections, the types of contributions that psychology can make to law 
will be illustrated in a variety of contexts, ranging from courtroom procedures 
and the treatment of individual delinquents and criminals to the formulation 
of laws and the development of national and international policies. 
1 This dilemma is reminiscent of what in statistics are known as Type I an 
errors. 
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TESTIMONY AND COURT PROCEDURE 


Oddly enough, the psychology of testimony represents one of the earliest areas 
of applied psychology. Among the pioneers in this research was William Stern, 
who not only conducted studies of his own but also edited a journal on this 
topic, Beitráge zur Psychologie der Aussage, published at Leipzig during the 
first decade of this century? During the same period, Alfred Binet, of intelli- 
gence test fame, was also conducting research and publishing on the psychol- 
ogy of testimony (12). In America, Harvard professor Hugo Miinsterberg 
wrote On the Witness Stand (99) in 1908, a book summarizing relevant psy- 
chological studies. Between 1909 and 1917, the Psychological Bulletin 
published annual reviews on the psychology of testimony, prepared by 
G. M. Whipple. Research on problems of testimony continued until the carly 
1930s, after which activity lapsed for some twenty years. Since about 1955, 
however, interest in the topic has revived (35). The current approach is 
characterized by a broadened scope and by the collaboration of psychologists 
with members of other professions, notably sociology and law. 

Contributions of General Experimental Psychology. Much of what 
psychology can contribute to the evaluation of courtroom testimony is derived 
from established facts of sensation, perception, memory, and other familiar 
areas of experimental psychology (10; 16, chs. 2-4; 113). Among the most 
relevant findings are those pertaining to: localization of sounds; visual illusions, 
such as the overestimation of vertical as contrasted to horizontal distances, the 
overestimation of filled as contrasted to unfilled spaces, and the influence of 
contextual factors upon length estimation as illustrated by the Miiller-Lyer 
illusion; estimation of short time intervals and of the speed of moving vehicles; 
and adaptation phenomena, such as shifts between light-adapted and dark- 
adapted vision and adaptation effects in the perception of odors, temperature, 
and other continuing stimuli. In another area, studies of incidental memory 
suggest that more reliance can be placed upon the testimony of a witness who 
realized he might later have to report on the events he observed than on one 
who observed them casually and with no effort to memorize facts. Contrary to 
legal folklore, strong emotion at the time of observation or subsequent report 
tends to increase the probability of error. 

A recent example of the bearing that psychological research on perception 
may have upon the evaluation of legal testimony is provided by the case of a 
Canadian hunter who, mistaken for a deer, was shot and killed by his com- 
panions (130). The victim was wearing faded red coveralls and was seen just 
before sunset on a very overcast day. The first pertinent psychological obser- 
vation derives from the Purkinje phenomenon. As illumination diminishes, 
colors at the red end of the spectrum decrease in brightness faster than do 
those at the blue end. Consequently, red objects are among the first to lose 
their color as light fades. A second relevant psychological fact concerns the 
effect of set, or expectation, upon the perception of an ambiguous stimulus. 
The hunters, who were eagerly scanning the landscape for deer, perceived 
the moving object as a deer. When a policeman later observed a man under 
the same conditions, knowing it was a man, he perceived the object as a man. 
2 This journal was superseded in 1908 by the much broader Zeitschrift fiir angewande 
Psychologie, also edited by Stern. It is interesting to note that the first journal of 


applied psychology evolved from a publication initially devoted to the psychology of 
testimony. 


549 


Psychology and Law 


Aussage Tests. First employed by William Stern, the Aussage test is now 
commonly known in the psychological literature by its original German name, 
meaning "testimony" or "report." In this test, a group of subjects are shown a 
picture or a short motion-picture film; or a brief and startling episode is en- 
acted before them (13, ch. 16; 131; 142). In the latter case, the subjects are 
not usually told in advance that they will be questioned about their observa- 
tions. For example, a stranger may burst into the classroom and engage in an 
altercation with the instructor; or an assistant may come rushing in and an- 
nounce that some white rats have escaped from the laboratory. Later, the 
subjects are asked to report on what they witnessed, giving details of appear- 
ance, dress, speech, and actions of the participants. 

By this procedure, accuracy of report may be investigated in relation to a 
number of variables, such as length of delay, prior instructions, suggestion, 
degree of emotional involvement, observer characteristics, and so on. One of 
the principal findings of this research is that errorless reports are rare. The 
usual reaction is to fill in the gaps in one's observation or recall of the scene 
with plausible details that are consistent with the individual's general per- 
ception of the event. The subject is often certain that he actually observed 
such a detail, although it was only suggested by the context. Under these con- 
ditions, moreover, agreement among several witnesses is no guarantee of fac- 
tual correctness, nor is it evidence of collusion. Such spurious consistency may 
result simply from constant errors that affect the responses of different persons. 
Thus another deep-rooted legal tradition regarding the validity of testimony is 
called into question. 

It is interesting to note that Stern (131) recommended the use of Aussage 
tests in the training of law students. By observing their own performance in 
the role of witness, prospective lawyers could thus gain a realistic understand- 
ing of the limitations of courtroom testimony. 1 f 

Method of Interrogation. Among the conditions commonly investigated 
by means of Aussage tests is the method of eliciting information from the wit- 
ness (10; 13, ch. 10; 16, chs. 5 and 6). Several early experiments consistently 
demonstrated that with free narration, in which the witness reports the inci- 
dent in his own words, accuracy is higher than with direct questioning. When 
asked specific questions, most persons give more details than in a free recital, 
but a larger percentage of their report is wrong. Under the more hostile or 
confusing type of questioning involved in cross-examination, accuracy tends to 
drop still further. 

Some data are also available regarding the form of the question (16, pp. 
119-123; 103). Leading questions that suggest a particular answer obviously 
tend to encourage false recall. Questions containing a hidden assumption like- 
wise have a strong suggestive effect. If asked whether the defendant held the 
gun in his right or left hand, a witness may honestly come to believe he saw 
a gun that was not actually there. Even the grammatical form of a question 
may exert an influence upon the response (16, ch. 5; 103). "Wasn't there a 
car parked in front of the bank?" is a more suggestive question than "Was 
there a car parked in front of the bank?" Directing attention to the witness 
himself as observer tends to increase response accuracy and caution. "Did you 
see a car?" is more likely to elicit a correct response than "Was there a car? 

It is also noteworthy that, when subjects are asked to indicate which facts 
they would be willing to report under oath, error is reduced but not eliminated 
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(16, pp. 153-155). For most persons, an appreciable proportion of error re- 
mains even among those facts they feel sufficiently sure about to affirm under 
oath. 

Other Aspects of Courtroom Procedure. Psychological research on other 
aspects of courtroom procedure is so far quite limited (138, chs. 5 and 6). 
Some suggestive data have been gathered on variability in the sentences passed 
by different judges for similar crimes (10, pp. 203-204; 38; 39). In one study 
of six New Jersey judges, the median sentences for the same type of offenses 
varied from 6 to 12 months (39). The percentage of cases sentenced to the 
penitentiary by the same judges ranged from 34 to 57. Individual judges also 
differ in the relative severity of sentences they impose for specific crimes, such 
as sex crimes as compared with other types of crimes. 

Although theoretically a judge’s decision should be “impersonal,” it is in- 
evitable that the decisions actually reached are influenced by the individual 
judge’s personality, point of view, and experiential background. That such in- 
dividual variables may affect judicial decisions even at the highest level was 
demonstrated in a factorial analysis of decisions by justices of the United 
States Supreme Court (137). Based on the voting records of the nine judges 
over a two-year period, the study covered 115 cases in which there were at 
least two dissenting votes. The factor pattern showed that the nine judges 
fell into subgroups exhibiting significant and consistent differences in voting 
behavior. 

Somewhat more information is available on factors affecting the behavior 
of a jury (10; 16, ch. 7; 96; 138, ch. 5; 142). Jury members’ perception of the 
reported facts, as well as their ultimate decision in the case, may be influenced 
by any prejudices they may have toward racial, national, religious, economic, 
occupational, or other groups. Social stereotypes and snap judgments based on 
appearance or mannerisms may likewise affect the jury’s evaluation of defend- 
ant, witnesses, and other participants in the trial. 

Apart from such general considerations, applicable to all human interactions, 
some research has been specifically concerned with the American jury system 
(8; 60; 61; 96; 133; 134; 138, ch. 5). In much of this research, which was 
conducted jointly by psychologists, sociologists, and members of the legal 
profession, the subjects were jurors drawn by lot from regular jury pools. 
Either mock trials or recorded proceedings of actual trials were employed. The 
jury deliberations were in turn recorded and subsequently analyzed with re- 
gard to a number of factors, such as the effect of foreman personality upon 
jury decisions; sex, socioeconomic, and educational differences in relation to 
participation and influence; and the proportion of time devoted to different 
topics during jury deliberations. 

Other experiments have been conducted with groups of law school or col- 
lege students and have dealt with more limited questions. Several studies have 
investigated the effect of order of presentation upon persuasiveness of evidence 
(56, 57 98, 142, 143). In one of these experiments (143), a detailed report 
of a criminal case was divided into 13 sections and read to groups of pre-law 
students acting as jurors. At the end of each section, the subjects rated their 
belief in the defendant's innocence or guilt on a nine-point scale. Figure 126 
shows the median rating assigned by the group after each installment of evi- 
dence was heard. Installment 13 has been omitted from the graph, since it 
was merely the verdict of the actual trial jury. It can be seen that the judg- 
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Fig. 126 Effect of order of presentation upon persuasiveness of evidence. 
CData from Weld and Roff, 143, pp. 619, 622.) 


ment of guilt rose with successive presentation of prosecution evidence and 
dropped with successive presentation of defense evidence. 

To test the effect of order of presentation, the same evidence was presented 
in a different order to each of two groups. When all the prosecution evidence 
was given first and the defense evidence second, the final median guilt rating 
was 2.3. When the series began and ended with prosecution items, with the 
defense items sandwiched in between, the final median guilt rating was 5.4. 
In the latter arrangement, the prosecution evidence had more influence be- 
cause of a recency effect. Other studies have demonstrated both primacy and 
recency effects (57, 98). Under ordinary courtroom conditions, primacy and 
recency effects tend to be equally strong (98). With different modes of pres- 
entation and different time schedules, either one or the other effect may pre- 


dominate (98). 


DETECTION AND STRESSFUL INTERROGATION 


The preceding section was concerned with testimony given by a witness who 
is motivated to be accurate and truthful. In many types of interrogation, how- 
ever, the respondent tries to conceal or disguise facts or to mislead the inves- 
tigator in other ways. The procedures discussed in this section share the com- 
mon objective of obtaining correct information under the latter conditions. 

Development of Lie Detection. Like the psychology of testimony, psy- 
chological research on lie detection dates from the beginning of this century. 
Among its pioneers were Benussi (7), Marston (93, 94), Larson (75, 76), 
Keeler (see 78, 79), and Summers (135, 136). Current procedures of lie detec- 
tion represent a special application of two techniques long familiar in the 
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psychological laboratory, namely, free association and the measurement of 
physiological indices of emotion. 

Basically, all lie detectors compare the subject's responses to critical and neu- 
tral stimuli. The critical stimuli are associated with the specific crime under 
investigation. Suppose the crime was the theft of a green purse containing, 
among other things, $34 in cash, a visiting card with the name Henry Dono- 
hue, and a book of matches from the Morningside Coffee Shop. In that case, 
the words “green,” "34," “Henry Donohue,” and “Morningside” could be 
used as critical stimuli in a free association test. Neutral stimuli could be 
chosen so as to match the critical stimuli as closely as possible. Thus among 
the neutral words could be those for other colors, numbers, and proper names, 
such as “blue,” “yellow,” “28,” “47,” “John Edwards,” “Herbert Reilly," "Ever- 
glade,” and so forth. The critical words would of course be scattered at random 
among the neutral words. 

If the subject's responses to the critical words are significantly differen’ from 
his responses to the neutral words, in terms of speed, failure to respond. + 2peti- 
tion of stimulus word, repetition of an earlier response, or other features, 
guilty knowledge is indicated. To the innocent subject, critical and neutral 
words are indistinguishable. This procedure is not applicable when all details 
of the crime have been widely publicized. It is based on the assumption that 
at least some details can be found that are known only to the guilty party. 
Many variants of the technique have been developed. Instead of stimulus 
words for free association, the investigator may use a series of questions, to be 
answered “Yes” or “No,” or a set of multiple-choice questions. For instance, a 
list of addresses may be given at only one of which a burglary was committed. 

Physiological measures of emotional excitement may be employed along 
with any of the verbal techniques described above. Any indices of emotion 
could be used, but the most common are changes in respiration, blood pres- 
sure, and galvanic skin responses (see 24, 78, 79). In the galvanic skin re- 
sponse (GSR), increased secretion of the sweat glands is detected by a rise 
in the electrical conductivity of the skin. When physiological measures are 
employed, the criterion of lying or guilt is again a consistent difference in re- 
sponse to critical and neutral items. Several physiological measures are often 
combined into a single recording instrument, known as a “polygraph.” This 
term, which literally means “multiple graphic record,” is commonly used as a 
generic name for lie detectors. 

Practical Applications of Lie Detectors. Today lie detectors are used 
widely for a number of purposes (78, 79, 144). As early as 1953, it was esti- 
mated that approximately one hundred police departments were using such 
tests (144). In police work, lie detectors are employed chiefly in the prelimi- 
nary examination and screening of suspects. Outstanding examples of their 
long-continued and effective application are provided by the Police Depart- 
ment of Berkeley, California, where the technique was introduced by Larson, 
and by the Chicago Police Scientific Crime Detection Laboratory, where it 
was introduced by Keeler. In addition, lie detectors are used regularly by sev- 
eral government agencies, such as the Atomic Energy Commission, and by 
all branches of the military services. They are also finding extensive applica- 
tion in banks, hotels, retail stores, and industrial plants, both in personnel se- 
lection and in periodic examination of employees, 

Surveys of the correctness of the conclusions reached by polygraphic tech- 
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niques have generally yielded encouraging results (144). Estimates of the per- 
formance of well-trained and experienced examiners usually run to about 75 
to Bo per cent correct decisions, 15 to 20 per cent uncertain, and 2 to 5 per 
cent incorrect decisions. Some highly experienced practitioners claim close to 
100 per cent accuracy for the records that they can interpret with confidence, 
after excluding doubtful or ambiguous cases. Because of the nature of the 
technique, moreover, the errors are usually failures to detect guilt; rarely is an 
innocent person shown to be guilty by the lie detector. The high optimism 
engendered by these surveys must be tempered by a consideration of the un- 
systematic and uncontrolled nature of the data on which they are based. 
Field studies of lie detection techniques in practical use need to be checked 
by controlled experimentation, of which there is still a dearth. 

So far lie detection data have only rarely been admitted as evidence in court 
trials (144). Occasionally such data are admitted with the prior consent of 
both parties in the litigation. In such cases, both parties agree to submit to a 
lie detection test administered by an examiner who is acceptable to both and 
whose remuneration is shared equally by them. The findings of the test are 
then presented by the examiner as expert testimony and are considered along 
with other evidence by the judge or jury. 

Whether in a legal setting or elsewhere, a lie detector should be employed 
only with the examinee's consent and with his knowledge of what the pro- 
cedure is designed to accomplish. The ethical question of invasion of privacy 
is relevant in this connection. With the increasing professionalization of poly- 
graph examiners, more attention is being given to the rights of the subject, 
necessary precautions in the use of lie detectors, and other matters of profes. 
sional ethics (78). There is also growing concern with questions of training, 
standards, and certification. Training programs in lie detection techniques are 
conducted by universities, police departments, the military services, and pri- 
vate organizations. An important step in the improvement of lie detection 
practices was the establishment in 1949 of the Academy for Scientific Interro- 
gation, a professional organization of leading practitioners from police depart- 
ments, the Armed Forces, and other settings (78, vol. 2). 

It should be noted that, while some psychologists are engaged in lie detec- 
tion work, the majority of practitioners in this field are nonpsychologists. The 
amount of psychological training that these practitioners have received varies 
widely. There is an obvious need for the closer involvement of psychologists 
in consultation, training, and research on lie detection. , 

Research on Lie Detection. The recent renewal of interest in all aspects 
of interrogation has been accompanied by a resurgence of research on lie de- 
tection (see, e.g, 24, 33, 70, 84, 85). To obtain better control of conditions, 
experiments are typically conducted with "simulated crimes," although every 
effort is made to achieve realism. For example, in a recent study by Kubis (70), 
conducted under Air Force sponsorship, volunteer college students participated 
in a simulated theft experiment. The experimental crime consisted in taking 
money from a coinbox attached to a pamphlet rack. The money was eventu- 
ally returned, but the subjects did not learn about this condition until the 
end of the experiment. 

In this exc, subjects were grouped into sets of three, one of whom 
served as “thief,” one as “lookout,” and the third as “innocent suspect. The 
importance of avoiding detection at all stages was impressed upon the partici- 
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pants. Each subject was examined by several standardized verbal procedures, 
while his physiological responses were recorded. Figure 127 shows a subject 
with attached apparatus, including chest pneumograph for obtaining brcath- 
ing records, plethysmograph cuff inserted over middle finger of right hand 
for blood pressure measurement, and palmar electrodes strapped to each hand 
for recording galvanic skin response. All examiners who conducted the lie 
detection tests and analyzed records were in ignorance of the roles assigned 
to individual subjects. 

One of the objects of this study was to try to objectify the evaluation of 
records with a view to their ultimate handling by a computer. In this connec 
tion, a promising finding was that analysts who worked only from the records 
did as well as the examiners who had contact with the subjects during the 
test. For all experimenters, the percentage of correct identifications ranged from 
73 to 92, with no uncertain category permitted. All these percentages exceeded 


the chance expectancy of 33 per cent correct by large and highly significant 


Fig. 127 Lie detection apparatus for measuring changes in breathing, blood 
pressure, and galvanic skin response. (Courtesy of Joseph F. Kubis; see 70.) 
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amounts, This study also explored ways of combining the three physiological 
measures statistically to reach a decision on each case. It was concluded, how- 
ever, that the interpretation of lie detector records is not yet sufficiently objec- 
tive to permit this type of treatment. 

Further research on lie detection is needed along several lines. Other indices 
of emotional excitement should be explored and ways of improving the indices 
now in use investigated. Some progress has already been made in the refine- 
ment of the physiological measures employed (24; 33; 78, vol. 1, ch. 7, and vol. 
2, ch. 8). For example, the use of a finger plethysmograph rather than the 
traditional arm cuff for measuring blood pressure permits continuous and pro- 
longed recordings without discomfort or injury to the subject (7o; 78, vol. 2, 
ch. 8). Other refinements of technique may decrease the frequency of ambigu- 
ous records and increase the objectivity of interpretation. 

More research is also needed on ways of combining the results of different 
physiological measures. In this connection, the findings on autonomic response 
sicreotypy cited in Chapter 15 are quite relevant (24, 73). These findings 
suggest that one person may show his maximal response to stress in increased 
hand sweating, another in respiratory changes, and so on. Under these condi- 
tions, the use of multiple measures (as in the typical polygraph) would seem 
desirable. Moreover, the weights of individual measures would vary with the 
individual. 

One of the unanswered questions in lie detection pertains to the mechanism 
underlying the physiological changes and other response disturbances that 
occur when critical stimuli are presented (24, 84, 85). Although there has 
been some theoretical speculation about the nature of this mechanism, it is 
as yet little understood. Some controversy centers around the extent to which 
lying as such can be detected, as contrasted to the possession of guilty knowl- 
edge. The example cited in the opening of this section was based upon the 
identification of guilty knowledge. When critical and neutral items are in- 
distinguishable to innocent respondents, consistent differentiation of responses 
to critical items obviously reveals guilty knowledge. Under these conditions, 
we would expect the maximum success with polygraphic techniques; and em- 
pirical findings do in fact support this expectation (84). À 

A variant of the guilty knowledge test was employed by Lykken (85) with 
dramatic success. In this experiment, the object was to identify each of 20 
subjects by examining their galvanic skin responses to a set of 25 questions 
pertaining to such details as mother’s first name, name of high school attended, 
street on which subject lived as a child, etc. Multiple-choice responses were 
presented for each item, in which one answer was correct for the given subject. 
Despite the fact that the subjects were highly motivated to mislead the experi- 
menter, correct identification occurred in 100 per cent of the cases! For all 
subjects, the pattern of galvanic skin responses revealed the differential reac- 
tion elicited by the familiar alternative in each set of item responses. 

Many of the practical applications of polygraphic techniques, — are 
based, not on the detection of guilty knowledge, but on the detection o rus 
itself. Similarly, in the previously cited research by Kubis (70), i bod 
verbal procedures investigated were concerned with lying. For Par “Did : 
guilty and innocent subjects were asked such critical questions as " dn 
take the money from the coinbox?" These questions were intersperse end 
emotionally neutral questions ("Are you a college student? Do you have 
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afternoon classes?") and emotionally toned questions unrelated to the crime 
(“Were you ever operated on for appendicitis?” "Is your draft status 1A? ). In 
this setup, the critical questions are easily recognized as such by innocent and 
guilty subjects alike, especially since the innocent subjects had been told that 
a theft of money from a coinbox was under investigation. The fact that anal- 
ysis of records led to significantly more correct judgments than expected by 
chance suggests that lying as such can be detected, at least under some cir- 
cumstances. 

A final question on which more research is needed concerns the effective- 
ness of countermeasures that the subject may employ to escape detection. 
Results obtained to date are somewhat conflicting (7o, 85). Although naive 
persons would probably find it very difficult to "beat the test," with fuller 
understanding of lie detector principles and special training in countermeasures 
it may be possible for some individuals to control their physiological responses 
sufficiently to mislead the examiner. Such findings in turn may stimulate the 
development of more nearly “unbeatable” techniques. 

Stressful Interrogation. Unlike lie detectors, traditional procedures of in- 
terrogation in both civilian and military practice have relied upon some form 
of stress to elicit facts from an uncooperative witness. Both the kind and in- 
tensity of stress vary widely in different situations, as illustrated by cross 
examination in courtroom procedure, the examination of witnesses by congres- 
sional investigating committees (35), "third-degree" interrogation in police 
investigations (79), and the questioning of prisoners of war (6, 11). In an 
extreme form, these stress techniques were illustrated by the so-called “brain- 
washing” of war prisoners by Chinese Communists during the Korean conflict. 
The brainwashing techniques employed to obtain false confessions and ideo- 
logical conversions were similar to those used to elicit secret military informa 
tion. 

Stressful interrogation in any form presents many serious questions of ethics 
and legality. What the psychologist can contribute to the situation is some 
clarification of the nature of the stresses involved and their effects upon the 
individual. Together with psychiatrists, physiologists, and other biological sci- 
entists, psychologists have analyzed the problems of stressful interrogation in 
the light of available scientific knowledge (6, 11, 34, 48). To a limited extent, 
they have also investigated these problems more directly, as, for example, 
through interviewing and testing returned prisoners of war (see 6). 

Although knowledge in this area is quite limited and many questions re- 
main unanswered, there is general agreement on certain points. First, the tech- 
niques of brainwashing employed in the Korean conflict were not so novel, 
scientific, or mysterious as suggested by popular accounts. There is no evidence 
that these techniques were based on the systematic application of any specific 
psychological principles. The methods used did not differ fundamentally from 
those employed many centuries ago for coercion and persuasion. Debility, in- 
tellectual disorganization, and emotional disturbances can be induced by 
intense physical and psychological stresses, such as semistarvation, lack of 
sleep, illness, physical abuse, isolation, and sensory deprivation. Threats and 
acute physical discomfort eventually produce compliance in some persons- 

Individuals differ in their responses to all these conditions. An important 
objective of psychological research was to identify the significant personal and 
background variables associated with such response differences, but the avail- 
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able data are meager. It does seem clear, however, that the disruptive effects 
of stress are augmented by suggestion and fear of the unknown. If the indi- 
vidual believes the interrogator has mysterious and powerful techniques for 
manipulating behavior, his resistance is Further reduced. Understanding pro- 
vides some measure of protection against these indirect effects of suggestion. 

Still another point on which there is general agreement is that stress tech- 
niques may be so disruptive as to render the respondent incapable of accurate 
reporting. Drugs, excessive fatigue, severe nutritional deficiencies, infection, 
and other bodily disturbances may adversely affect cerebral functioning and 
thereby interfere with normal thought processes. Any drug that weakens the 
respondent's ability to withhold information, for example, is also likely to 
impair his ability to impart information. A victim of hallucinations is hardly 
a reliable informant. Even milder forms of stress, such as strong emotional 
excitement, may produce some confusion, loss of memory, and other thought 
disturbances. In these disruptive effects lies the major psychological weakness 
of stressful interrogation. It should be borne in mind, however, that this limita- 
tion applies chiefly to the use of stressful interrogation for fact-finding purposes 
rather than for propaganda, ideological conversions, false confessions, and 
other purposes for which it was employed in the Korean conflict. 


PSYCHOLOGICAL FACTORS IN 
DELINQUENCY AND CRIME 


For nearly a century, anthropologists, sociologists, psychiatrists, psychologists, 
and other specialists have been studying the characteristics of delinquents and 
criminals in the effort to identify the causes of antisocial behavior (see 83; 
138, chs. 8 and 9). Among the first to pursue such investigations was Lom- 
broso (8o, 81), a nineteeth-century Italian anthropologist who proposed that 
the criminal was a throwback to an earlier, more primitive stage of human 
evolution. In support of this doctrine, he gathered extensive data on physical 
anomalies, or "stigmata of degeneracy," that he observed in prison i mor 
Examples include prognathous jaw, receding chin, low forehead, flattene 
nose, facial asymmetry, deformities of teeth and palate, and malformations 
of the ears. This theory of a hereditary "criminal type enjoyed onida 
popularity until control observations revealed equal incidence of these physica 
defects among comparable non-criminal populations. ; 

After the turn of the century, the emphasis shifted to mental deficiency as 
the major cause of crime (46, 47, 83, 128). This hypothesis was bolstered by 
investigations of such families as the Jukes and Kallikaks, which showed a 
high concentration of mental deficiency along with crime, more A 
coholism, pauperism, and other social problems over many generations. a y 
interpretations of these familial studies stressed poor heredity, failing to 5 : 
into account the extremely poor home and community environments in whic 
each successive generation of children had been reared. i . 

That intellectual defect cannot account for a large proportion of crimes 
was demonstrated by two extensive surveys of the intelligence test ied 
ance of men in state penitentiaries (100, 139). Conducted shortly after Wor 
War 1, both studies found the distribution of Army Alpha scores of the peni- 
tentiary samples to be virtually identical with that of the total Army draft from 
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the same states. As might be anticipated, there was a significant relation 
between intelligence and type of crime. Individuals committed for fraud ob- 
tained the highest mean scores, while those committed for sex crimes obtained 
the lowest. The proportion of all crimes that could be attributed to mental 
deficiency, however, was quite small. 

Surveys of juvenile delinquents reveal a somewhat greater incidence of low 
intelligence. The proportion of mentally defective and borderline cases is 
appreciably larger than that in the general population, and the mean IQs 
center around 8o. The deficiency is particularly apparent on verbal tests, which 
are highly correlated with school achievement. Several reasons have been sug- 
gested for the divergent findings regarding the intelligence of adult criminals 
and juvenile delinquents. The intellectually subnormal delinquent is more 
likely to be caught early in life, while the brighter delinquent escapes detec- 
tion longer. When apprehended, mentally defective delinquents are usually 
sent to institutions for mental defectives, where they have no further oppor- 
tunity to come into conflict with the law. Furthermore, the types of crimes 
committed by most juveniles are those that predominate among persons of 
low mental level, including physical assault, destruction of property, sex 
crimes, and a variety of minor offenses. 

The relation between the juvenile delinquent's typical hostility toward 
school and his intellectual retardation is noteworthy. The intellectually back- 
ward child naturally experiences more failure and frustration in school. Such 
frustration tends to engender aggressive behavior. At the same time, the delin- 
quent’s antagonism toward authority and restrictions makes him a poor school 
learner and may in turn account for his retarded intellectual development. 
This is an example of the difficulty of unraveling cause-effect relations in de- 
linquent careers. 

Like intellectual defect, other pathological conditions undoubtedly account 
for some crimes, but their proportional contribution is small. A few crimes are 
committed by psychotics, as illustrated by the paranoid schizophrenic who 
shoots several strangers under the delusion that they are plotting his destruction. 
Similarly, the kleptomaniac, who steals articles for which he has no use, and 
the pyromaniac, who sets fires at random, are extreme examples of compul- 
sive behavior. The psychopathic offender, who is impervious to socialization 
and unconcerned about the consequences of his actions, has been the subject 
of much controversy and is as yet little understood (138, ch. 12). The roles 
of sex perversions, drug addiction, and alcoholism in criminal behavior have 
been widely discussed (see, €.8., 32, chs. 9, 12, 13; 138, chs. 15-17). Certain 
forms of brain damage may lead to a characteristic syndrome of impulsivity and 
overactivity, as illustrated by delinquent children with a historv of encephalitis 
lethargica. Although psychiatrists have written more extensively about these 
special types of criminal behavior, psychologists have been making increasing 
contributions through clinical studies and research. 

Much of the research on causes and treatment of crime in general has been 
conducted by sociologists, although some of these investigations have dealt 
with distinctly psychological variables, An outstanding example is the con 
tinuing series of studies by the Gluecks (42-45), spanning a period of over 
thirty years. One part of this project involved periodic follow-ups of 599 
youthful offenders committed to the Massachusetts Reformatory. Data were 
gathered retrospectively regarding their careers prior to and during incarcera- 
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tion, as well as directly through repeated follow-ups 5, 1o, and 15 years after 
discharge. Among the objectives of this study was a comparison of the effective- 
ness of different types of penocorrectional treatments, such as probation and 
parole (43). 

A major finding of the Glueck research pertains to the early home and 
community backgrounds of the offenders, which were characterized by many 
adverse social and psychological conditions. Parental education and socioeco- 
nomic level were typically low; economic dependency, overcrowding, and in- 
sanitary living conditions were common; and broken homes, parental neglect, 
and criminal influences were prevalent. These results are typical of those ob- 
tained in other studies of the early environments of delinquents. While recog- 
nizing the influence of such broad environmental factors, the Gluecks in their 
later work have tended to emphasize the individual's own personality traits 
as a major determinant of his behavior both before and after correctional 
treatment. An important influence contributing to the development of these 
traits is the nature of parentchild relations during early childhood. The role 
of these interpersonal factors was brought out in a later study by the Gluecks, 
in which soo delinquent boys were compared with a non-delinquent group 
matched with them in age, general intelligence, national and ethnic origin, 
and residence in underprivileged neighborhoods (44). The investigators hoped 
thereby to help explain why some children turn to delinquency while others 
remain law-abiding even when reared in the same neighborhoods. 

An even more direct approach to this question is provided by an earlier 
study by Healy and Bronner (52), in which extensive case-history data were 
gathered on 105 juvenile delinquents and on their non-delinquent siblings. 
The gross environmental variables characterizing the socioeconomic, cultural, 
and educational level of the family were thus automatically held constant. 
In comparison with their non-delinquent siblings, many more delinquent chil- 
dren showed strong dislike for school and for individual teachers. The delin- 
quents manifested more discontent and emotional disturbance and more often 
reported feelings of rejection, insecurity, and inferiority. When non-delin- 
quents experienced comparable emotional difficulties, there were usually coun- 
terbalancing satisfactions that made their situations more tolerable. 

A limitation of most delinquency studies, even longitudinal studies such as 
those of the Gluecks, is that observations begin after the subjects are identified 
as delinquents. Retrospective studies of childhood experiences are always sub- 
ject to error, even when skilled interviewers and caseworkers gather the data. 
Moreover, it is difficult to disentangle causal relations under these circum- 
stances. If delinquents are backward in school, hostile toward authority, 
emotionally insecure, and so on, we cannot determine to what extent these 
reactions may result from the delinquent behavior itself, as well as from the 
experience of arrest and institutional commitment. One way to avoid this dif- 
ficulty is to begin a longitudinal study before delinquent behavior occurs. This 
requires the gathering of data on a large sample of children, who are then 
followed up to see who becomes delinquent and who does not. — 

A few psychologists have followed this approach by surveying large samples 
of elementary and high school pupils by means of personality tests and rating 
scales, such as the Minnesota Multiphasic Personality Inventory (51; 138, ch. 
16; 145), Haggerty-Olson-Wickman Behavior Rating Schedules (107, pp. 284- 
289), and other self-report inventories and rating scales (3). Similarly, first- 
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Fig. 128 Mean scores on Haggerty-Olson-Wickman Behavior Rating Schedules 


in relation to delinquency record within a six-year follow-up. (From Olson, 
107, v. 286.) 


grade public school boys in high-delinquency areas in New York Gity and 
Washington, D.C., were evaluated by use of one of the Glueck prediction 
tables, based on parent-child relationships (15, ch. 21; 41, pp. 1023-1051; 138, 
ch. 10). Despite the initial examination of large groups, the number of delin- 
quents studied in these samples is too small to permit conclusive generaliza- 
tions. Nevertheless all provide evidence that delinquency can be predicted 
from prior behavior, even as early as the first grade. Figure 128 illustrates re- 
sults obtained with the Haggerty-Olson-Wickman Behavior Rating Schedules, 
used by teachers to report and rate the occurrence of behavior problems among 
schoolchildren (107, pp. 284-289). These scales were applied to approximately 
3,000 children, about 1,500 of whom were in the first grade, while the 
remainder were enrolled in the eighth grade, junior high school, or special 
classes for the mentally retarded. After a lapse of six years, the names of 
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the first- and eighth-grade children were checked against probation office 
and court records. The original mean problem-tendency scores of. delinquents 
thus identified were compared with total group norms, with the results 
shown in Figure 128. Not only had the delinquents as a group manifested 
more behavior problems than did the children in general, but the mean 
problem-tendency score also rose with the severity of the offenses subsequently 
committed. 

Another attempt to isolate causal factors is represented by a study of the 
child-rearing attitudes of mothers of delinquent and non-delinquent adolescents 
(92). Significant differences in the expected direction were found, although it 
is possible that some of these differences may represent parental reaction to 
earlier delinquent behavior on the part of the child. Mention should also be 
made of the Kvaraceus Delinquency Proneness Scale (72), which utilizes 
many items found to differentiate delinquent from non-delinquent groups, in 
the effort to detect tendencies toward delinquency in individuals. Under the 
innocuous title of KD Proneness Scale, this instrument presents 75 multiple- 
choice questions on preferences and attitudes to be filled out by the child. 
The same author prepared the KD Proneness Check List for use by teachers 
and other professional workers in rating a child with regard to both personal 
and environmental factors associated with delinquency (72). 

In summary, it should be noted that current research on the causes of 
delinquency and crime has put increasing emphasis upon the individual's 
emotional, attitudinal, and motivational characteristics, as derived at least in 
part from childhood experiences. The most important single finding, however, 
is that crime and delinquency have, not one, but many causes. Causal factors 
vary among individual delinquents and criminals, a fact that strongly supports 
differential individual treatment. Even within a single individual, however, 
there is no single cause for antisocial behavior, but rather a pattern of inter- 
acting factors. 


THE PSYCHOLOGIST IN CORRECTIONAL SERVICES 


At every stage in the treatment of delinquents and criminals, there is need 
for psychological services. Yet the utilization of such services has proceeded 
very slowly, and the number of psychologists so employed is still far below 
estimated requirements. The psychologist working in a correctional setting is 
typically a clinical or counseling psychologist with special experience in deal- 
ing with problems of delinquency and crime. In most situations, he functions 
as a member of a team which includes a psychiatrist and a social worker. In 
order to make effective recommendations, he needs to be familiar with penal 
institutions, social agencies, and other community facilities. i 
Psychologists may be employed on a full-time or part-time basis by courts, 
probation departments, parole boards, penitentiaries, reformatories, training 
schools for juvenile delinquents, or other penal and correctional institutions. 
They may also be called upon as consultants on individual cases when no 
psychological services are regularly available. In all these contexts, the prin- 
cipal activities of the psychologist consist in the application of the various clin- 
ical and counseling techniques discussed in Chapters 13 to 17. While the focus 
is still predominantly on diagnostic functions, therapeutic and research activi- 


562 


PSYCHOLOGY AND OTHER PROFESSIONS 


ties are slowly being introduced. Much of the work of psychologists in correc- 


tional settings is of an advisory nature. Thus the psychologist may provide 
facts and evaluations to aid a judge or a parole board in the disposition of an 
individual case. Or he may recommend a treatment program to be carried out 


by a social worker, teacher, probation officer, or other professional personnel. 
Although the functions of psychologists in different correctional settings have 
much in common, we shall examine the major features of psychological serv- 
ices in court systems, adult prisons, and institutions for juvenile delinquents. 

Court Psychologist. Most court systems today have access to consultants 
in psychology and other relevant specialties, such as psychiatry, social work, 
and education (32, ch. 14; 138, ch. 11). The principal function of such special- 
ists is to help in the understanding of the offender, so that the disposition of 
the case can be adapted to the characteristics of the individual as well as to 
the nature of the crime. Referrals for psychological evaluations of children and 
adults may be made prior to trial or prior to sentencing. Juvenile courts in 
large cities usually have their own behavior clinic, whose staff carries on these 
functions. Such clinics are also attached to some domestic relations courts and 
to a few criminal courts. 

Like most clinical psychologists, court psychologists began as mental testers. 
Early court psychologists did little more than administer the Stanford-Binet 
or other intelligence tests to find the individual's “IQ.” Slowly their functions 
have been broadening to cover a comprehensive diagnostic evaluation of the 
individual through a variety of testing and interviewing techniques. By these 
procedures, the court psychologist tries to identify the mentally defective, psy- 
chotic, brain-damaged, neurotic, or other pathological offender. He provides 
information regarding personal characteristics and background that helps to 
clarify the individual offender’s motivations and attitudes and facilitates the 
prediction of his subsequent behavior. He also evaluates the individual’s poten- 
tial for education, vocational training, and job placement. 

On the basis of his findings, the court psychologist may also make recom- 
mendations regarding disposition and treatment. In some court clinics, the 
psychologist does a limited amount of counseling and short-term psycho- 
therapy. With children, the court clinic can serve a preventive function by 
prescribing remedial and therapeutic programs. Thus the youthful offender 
may be salvaged at an early stage without formal adjudication as a delinquent 
or institutional commitment. As in all clinical work with children, interview- 
ing and counseling of parents and other significant adult associates are an 
integral part of the process, Remedial instruction, readjustments in school 
placement, recreational programs, foster home placement, and other types of 
iin’ therapy may be arranged upon the psychologist’s recommenda- 

on. 
i Prison Psychologist, Although the first organized psychological service 
in an American prison was established as early as 1919 (58), the number of 
psychologists so employed has been increasing at an incredibly slow rate. 
Recent estimates place the number of prison psychologists at about one for 
every three thousand prisoners (23; 32, p. 358). Moreover, the psychological 
training of those who do hold these positions is often quite inadequate. 
Among the major factors that have discouraged well-qualified clinical psychol- 
ogists from working in such settings are poor cooperation from other insti- 
tutional personnel and the difficulties of conducting psychological services in 
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institutions not fundamentally oriented toward rehabilitation. Although there 
are outstanding exceptions, correctional institutions have been very slow in 
accepting a therapeutic point of view (23; 32, ch. 14; 138, ch. 13). 

One of the chief functions of the prison psychologist is the evaluation of 
new admissions for purposes of classification. Diagnostic testing and interview- 
ing for the assessment of intellectual and personality characteristics form a 
regular part of intake procedures. Since midcentury, however, there has been 
an increasing trend toward having such assessments conducted at a diagnostic 
center to which prisoners are sent prior to institutional assignment. Thus the 
choice of institution can be based in part upon the results of the diagnostic 
evaluation. A major purpose of classification is the assignment of the indi 
vidual to appropriate work functions, educational programs, or vocational train- 
ing within the institution. For this reason, the testing of adult prisoners puts 
considerable emphasis upon the measurement of vocational aptitudes, skills, and 
interests. In connection with the evaluation of parole candidates, the psycholo- 
gist again conducts a comprehensive assessment of individuals with special 
reference to their readiness for community living and the prediction of sub- 
sequent behavior. 

Prison psychologists also help in identifying individuals with severe psycho- 
pathology, who may need referral to special institutions, as well as those with 
milder disorders, who may be treated locally. Insofar as their work load per- 
mits it, prison psychologists carry on short-term individual or group psycho- 
therapy with prisoners. Counseling is more generally practiced, with particular 
reference to vocational planning and to preparation for a return to the com- 
munity, with its many adjustment problems. For the latter purpose, prison 
psychologists often employ group techniques, which may range from ordinary 
discussion meetings to psychodrama and other therapeutic methods. Psycholo- 
gists may also participate in the planning and development of training pro- 
grams for prisoners. In this connection reference may be made to the HUE 
machine project initiated at the Draper Correctional Center, which was cite 
in Chapter 18 (89). 1 

Persons committed to correctional institutions represent an excellent popula- 
tion for basic research on deviant behavior, as well as on other psychological 
problems. There are few other situations providing such large samples of 
adults whose time can be made available for research under ogane living 
conditions. Nevertheless, little basic psychological research has so x 5g ae 
ducted in these settings. Most of the research activities of psycho i51 y 
been limited to applied research designed to evaluate the predictive m Ad 
diagnostic techniques or the effectiveness of rehabilitation s une al 
these purposes, follow-up studies are conducted in which psychologists usually 
collaborate with psychiatrists and social workers. 
à; e Psychologist in the Trainin School. 
Ogical services has been more widely recognize 
delinquents than in institutions for adult prisoners, development p e 
Services, too, has been slow. In a 1953 survey of public training ascen 
delinquent boys, only about one-half reported having one or esi (123) 
Psychologists on their staff; one-fifth employed no soda d sitions à 
Training standards were again low; and there were many T ? EUR Jelin- 

The functions of the psychologist in the training school for ju Hations for 
quents are very similar to those of psychologists working Ji Iase 


Although the need for psycho- 
d in institutions for juvenile 
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emotionally disturbed children (67; 1 38, ch. 14). In contrast to the work with 
adult prisoners, more emphasis is placed upon the diagnosis of personality 
difficulties. Although vocational training and counseling are an integral part 
of the rehabilitation program, more attention is given to psychotherapy than in 
adult programs. Play therapy and art therapy are especially applicable in such 
settings. Remedial educational programs are particularly important because of 
the juvenile delinquent’s educational retardation and poor school adjustment. 
The psychologist may also make recommendations regarding milieu therapy, 
both with regard to total institutional organization and with regard to treat- 
ment of individual cases. 


THE PSYCHOLOGIST AS EXPERT WITNESS 


As long ago as 1911, Judge Learned Hand wrote in one of his decisions: 


How long we shall continue to blunder along without the aid of unpartisan and 
authoritative scientific assistance in the administration of justice, no one knows; but 
all fair persons not conventionalized by provincial legal habits of mind ought, I 
should think, unite to effect some such advance (cited in 4o, p. 685). 


One of the means whereby scientific assistance may be made available to judges 
and juries is through the testimony of expert witnesses. In law, an expert in a 
given field is one who by virtue of his training and experience has "the power 
to draw inferences from the facts which a jury would not be competent to 
draw” (82, part I, p. 244). 

Psychology is one of the most recent sciences to be recognized by the courts 
as providing such special training and experience. To be sure, individual 
psychologists have testified as experts since early in this century, This was 
particularly true in Germany, where such eminent psychologists as Karl 
Marbe, William Stern, and Kurt Bondy were called upon to testify on a 
variety of questions for which their training in experimental psychology 
qualified them (see 86, 131). Nevertheless, it is only since 1950 that psy- 
chologists have begun to make an appreciable contribution in this role. Within 
the. decades of 1950 and 1960, American courts have been broadening and 
clarifying their practices regarding the admissibility of expert testimony by 


licensing of psychologists, specialty certification by the American Board of 
Examiners in Professional Psychology, standardization and evaluation of train- 


the parties to the suit (82, 86). Psychologists have testified, in both civil and 
criminal cases, regarding many aspects of human behavior. Clinical psycholo- 
gists have been called upon to assess intellectual and emotional conditions of 
children and adults in connection with commitment for mental deficiency or 
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They have made similar contributions in accident cases where the extent of 
pene damage resulting from brain injury was to be ascertained (37, 86, 
957. 

Although clinical psychologists are the most likely to be called upon to 
testify, psychologists in other areas of specialization are also being utilized 
increasingly for such purposes. Consumer psychologists, for example, can be of 
service in cases involving trade-name and trademark infringement (see Chap- 
ter 12), misleading advertising, and other questions that pertain basically to 
consumer reactions or to public opinion (40; 77; 82, part I, pp. 258-272). For 
these purposes, relevant data can be gathered from a representative sample of 
the appropriate population and presented to the court with interpretations by 
the psychologist. 

Experimental psychologists also have much to contribute, especially in the 
area of perception and memory. It was with perceptual and memory problems 
that some of the earliest court testimony by psychologists was concerned. A 
more recent illustration is provided by the case cited earlier in this chapter, in 
which a hunter was inadvertently shot by his companions (130). So far, 
however, the utilization of experimental psychologists as expert witnesses has 
been minimal. Mention should also be made of the testimony of social psy- 
chologists in connection with such matters as racial segregation, to be con- 
sidered more fully in the next section (19-21, 66). 

Testifying as expert witnesses is an activity in which most psychologists are 
still quite inexperienced. For this reason, several recent articles have set forth 
the pitfalls that may be encountered in such testimony (74, 87, 106, 114, 
122). When the psychologist testifies for one side in a court trial, he must 
submit to cross-examination, in which he is called upon to defend not only the 
validity of his evidence but also his qualifications as an expert. This is true of 
any expert who testifies within the adversary system of court procedure, what- 
ever his field of specialization. The psychologist, however, may encounter addi- 
tional difficulties, arising from the lack of clarity that still characterizes the 
position of psychology among the learned professions and its relation to 
medicine. 

If forewarned about these problems, the psychologist can of course function 
more effectively in his role as expert witness. In addition, certain impending 
modifications in legal procedure are designed to improve the conditions under 
which expert testimony is given. The Model Expert Testimony Act adopted 
by certain states enhances the objectivity and impartiality of the expert wit- 
ness and removes him from the adversary process (144, p. 15). The act 
provides that each side pay one-half the fee of an expert acceptable to both 
and appointed by the court. Each party, moreover, is entitled to examine the 
expert's report in advance of the trial. a n" 

Still another way in which the expert witness may maintain his impartiality 
is to testify as amicus curiae (friend of the court). This procedure has been 
employed especially by groups or associations which are in a position to pro- 
vide specialized information for the guidance of the court. The relevant facts 
and interpretations may be presented either orally, on the witness stand, or by 
filing a written amicus curiae brief. Since this practice has been followed 
particularly in test cases with relatively broad social implications, it will be 
illustrated more fully in the next section. i 

One recent example of the utilization of an amicus curiae brief, however, 
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has a special bearing on the future role of psychologists as expert witnesses (54, 
117). In a criminal case tried before the United States District Court for the 
District of Columbia, both psychologists and psychiatrists had testified regard- 
ing the mental condition of the defendant. At the conclusion of the trial, the 
judge instructed the jury to disregard the opinions of the psychologists, on the 
grounds that only medically trained experts are competent to give an opinion 
on such a question. Subsequently, in a 2:1 vote, the panel of the appellate 
court which heard the case held, among other things, that the trial judge had. 
erred in excluding the expert testimony of the psychologists and ordered a 
rehearing en banc (before the full nine-man Court of Appeals). The rehearing 
was to be limited to the issue of “the ruling of the District Court which 
excluded from consideration by the jury the testimony of the psychologists 
concerning the existence and effects of the ‘mental disease or defect’” (54, 
p. 627). 

Because of the important implications of the ultimate decision, the Ameri- 
can Psychological Association filed an amicus curiae brief explaining the train- 
ing and experience of clinical psychologists and submitting that clinical 
psychologists are fully qualified to express expert opinion on such matters. The 
American Psychiatric Association filed a brief in support of the contrary argu- 
ment. The final opinion of the United States Court of Appeals, handed down 
on June 7, 1962, sustained by a 7:2 vote the acceptibility of testimony by 
properly qualified psychologists in cases involving the determination and mean- 
ing of mental disease or defect as productive of criminal acts (54, p. 626). This 
decision should go far toward clarifying the functions of clinical psychologists 
as expert witnesses. 


PSYCHOLOGY IN THE DEVELOPMENT OF LAWS 


Since by its very nature the law deals with human behavior, it appears obvious 
that psychology should have much to contribute to the formulation, revision; 
and interpretation of laws. Both among jurists themselves and among members 
of such professions as psychology, sociology, and psychiatry, there has been a 
steadily mounting discontent with the obsolete and prescientific concepts of 
human nature upon which many laws and legal practices are based. 

Among the most notorious examples is the "MeNaghten rule" for establish- 
ing legal sanity and criminal responsibility C15, ch. r7; 129; 138, ch. 7). 
Stemming from a decision handed down in the case of one Daniel McNaghten 
tried in a British court in 1843, this rule uses as a test of insanity the defend- 
ant's intellectual inability to differentiate between right and wrong. For over 
a hundred years, the McNaghten rule has been criticized as unrealistic by 
physicians, psychiatrists, and (more recently) psychologists. Actually this rule 
reflects the layman's misconception of the nature of insanity, rather than the 
accumulated scientific knowledge about the behavior of mentally disturbed 
persons. An insane person often knows that his act is wrong and forbidden by 
law; nevertheless he may commit it as a result of his mental disorder (129; 
P- 359. 

Although the McNaghten rule is still widely applied in American courts, à 
few states have begun to replace it with more realistic practices. The trend of 
these innovations is in the direction of giving the expert witness more freedom 
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in reporting on the defendant's mental condition and its relationship to his 
criminal act. Rather than forcing the professional expert to use concepts that 
fall outside his field and limiting him to a single “test of insanity,” these 
practices permit him to utilize all the resources currently provided by his 
science. 

There are many other areas of legal practice in which available psychologi- 
cal knowledge can make a substantial contribution. Among them may be 
mentioned laws pertaining to drug addiction, alcoholism, and sexual offenses 
(15, ch. 19; 32, chs. 12-13; 138, chs. 15-17). Other examples can be found 
in laws concerned with child development and educational matters (66) and 
laws dealing with civil rights and intergroup relations (111). 

Granting that psychologists, along with members of other professions, ought 
to participate in the development of laws, we may next inquire through what 
channels they may do so. One approach is to assist the courts in their testing 
and interpretation of laws. As reported in the preceding section, a psycholo- 
gist may submit relevant information, either as amicus curiae or as an expert 
witness for one of the parties to the suit. There has been an increasing tend- 
ency for courts to accept behavioral science data for consideration. One of the 
first instances of the extensive use of psychological data in a legal brief con- 
cerned the effects of bilingualism upon the educational development and 
adjustment of children (66). The case dealt with a 1943 statute prohibiting 
the teaching of foreign languages (ie., other than English) to Hawaiian 
schoolchildren below the age of ten or before completion of the fourth grade. 
A survey of research on the intellectual and emotional effects of bilingualism 
led to the conclusion that bilingualism as such is not a handicap. Detrimental 
effects are attributable to improper ways of introducing the second language, 
poor teaching practices, and other concomitant circumstances that need not be 
associated with bilingualism. These findings were utilized by the lawyers in 
preparing an amicus curiae brief. 

In a more recent series of cases dealing with racial discrimination in transpor- 
tation, schooling, and other settings, social psychologists collaborated with other 
social scientists and with lawyers in the presentation of relevant data (ro- 
21, 66). Psychological research bearing upon problems of racial segregation in- 
cludes such varied topics as the test performance of different racial groups, the 
effect of cultural factors upon intellectual and personality development, the 
nature and origins of intergroup attitudes, the role of various factors in attitude 
change, and the effects of segregation upon the self concept and general 
psychological development of the individual (2, 20, 66). Particularly in cases 
dealing with educational segregation, psychologists have on several occasions 
testified as expert witnesses in state and Federal district courts. These cases 
culminated in the momentous decision of May 17, 1954, when the Supreme 
Court of the United States ruled that state laws requiring or permitting racial 
segregation in public education are unconstitutional. In support of its con 
clusion that “segregated educational facilities are inherently unequal, the 
Court cited materials that had been submitted by psychologists in an Appendix 
to the Appellants’ Briefs (21, pp. 227-235): eae f 

Another major way in which psychologists can assist in the development o 
laws is by serving as consultants to special legislative committees concerned 
with the revision of laws. Both in America and elsewhere, such committees 
have been drawing more and more upon the advisory services of scientifically 
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trained specialists, including psychologists (15, ch. 19; 74). In 196o, for ex- 
ample, the Interim Committee of the Oregon State Legislature on Revision of 
the Criminal Code invited representatives of the Oregon Psychological Asso- 
ciation to attend one of its meetings and express their views on questions deal- 
ing with the McNaghten rule, the classification and treatment of prisoners 
Cespecially sex criminals), the advisability of pre-sentence investigations as aids 
to judges, and the role of psychology in determining whether or not a person 
is mentally ill (74). It might be added that, because of their concern with laws 
affecting the practice of psychology, state psychological associations have per- 
force learned a good deal about legislative machinery. These experiences 
should in turn help psychologists in making an effective contribution to the 
development of laws in general. 

Psychologists may also contribute by collaborating in the research and teach- 
ing functions of law schools. As an example, mention may be made of the 
promising program on Law and the Behavioral Sciences, initiated in the mid- 
1950s at the University of Pennsylvania Law School. Besides involving the 
participation of a psychologist in the teaching of law students, this program 
led to several joint reports and publications. The project has so far concen- 
trated on summarizing and interpreting psychological knowledge pertaining 
to three areas in the law: evidence, criminal law, and family law. One publica- 
tion, for example, provides a survey and critical evaluation of psychological 
studies relevant to certain aspects of censorship laws (17). 


PSYCHOLOGY IN NATIONAL AND 
INTERNATIONAL AFFAIRS 


Scope of Participation. The preamble to the UNESCO Constitution 
declares that “since wars begin in the minds of men it is in the minds of men 
that the defenses of peace must be constructed” (147, p. 712). The conditions 
that precipitate wars and promote peace are undoubtedly numerous and 
complex. It would be a gross oversimplification to single out the psychological 
factors in the effort to explain and improve international relations. Neverthe- 
less such factors do play a part, and their investigation should contribute to- 
ward an understanding of international problems. Psychologists may use their 
specialized techniques to study selected aspects of these problems, or they may 
collaborate with experts in political science, international law, and other 
relevant specialties in carrying out more comprehensive projects. 

Although research oriented specifically toward international problems is of 
relatively recent origin, social psychologists have for some time recognized 
international relations as one of the many types of group relations falling 
within their special sphere of interest. Since 1936, the Society for the Psycho- 
logical Study of Social Issues (SPSSD, a division of the American Psycho- 
logical Association, has had a special committee concerned with problems of 
war and peace. In 1945, the same society sponsored the preparation of a 
volume devoted to the discussion of these problems (102). Because of the 
mounting international tensions and the threat of nuclear warfare, an increas- 
ing number of psychologists from different specialties are now becoming con- 
cerned with these problems. 


3 Personal communication from Dr. Julius Wishner, Dec. 10, 1962. 
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By the late 1950s, the participation of psychologists in this area had at- 
tained substantial proportions, with a broader base and a more systematic 
approach than had characterized it heretofore (118). A number of universities, 
both in this country and abroad, established interdisciplinary institutes for 
research on group relations and conflict resolution (105, pp. 89-90; 118, pp. 
101-102). In 1957, The Journal of Conflict Resolution was founded by 
representatives of many social sciences, including psychologists. Colleges began 
to offer courses in the psychology of international relations (see, e.g., 25). The 
Journal of Social Issues, official organ of SPSSI, devoted an issue to “research 
approaches to the study of war and peace” (65) and another to “psychology 
and policy in a nuclear age” (120). Attempts to collate the rapidly growing 
literature in the field were made in the June, 1962, SPSSI Newsletter (28) 
and in a later book prepared under the auspices of this society (64). Several 
symposia in this area were held at meetings of the APA and at international 
congresses of psychology (105). 

The spreading interest of psychologists in problems of international rela- 
tions is illustrated by the establishment in 1960 of an APA Committee on 
Psychology in National and International Affairs and the appointment in 1962 
of a full-time executive secretary of this committee, attached to the central 
office of the association (1). This committee has an active program under way, 
involving not only the coordination and facilitation of research by individual 
psychologists but also the establishment of contacts with government agencies 
and United Nations officials. 

The types of contributions that psychologists can make in the area of inter- 
national relations can be grouped roughly under four headings: (1) theoretical 
analysis of policy issues for their psychological components; (2) surveying and 
communicating available psychological findings on relevant topics; (3) re- 
search on problems pertaining more directly to national and international 
policies; and (4) application of current psychological knowledge to action 
programs (118; 1 19; 120, pp. 79-84). í 

Psychological Analysis of Policy Issues. Before the psychologist can 
make any contribution to international relations, he needs to analyze the 
issues for their psychological components and identify problems that are 
amenable to research. Such analyses are a. necessary first step in any research 
program. Effective research cannot be done on anything as broad or abstract 
as war and peace. Not only must questions be formulated in specific terms, 
but the psychological aspects of the problems must also be separated from 
those falling in other areas. 

Apart from facilitating research, such psychological analyses may serve other 
functions. An example is the identification of hidden assumptions about 
human nature that may be questionable or contrary to the trend of empirical 
data (127). Through teaching in political science departments, through non- 
technical publications, and through personal consultations with policy makers, 
psychologists can engender a fuller awareness of the psychological implica- 
tions of different courses of action. They can also encourage the substitution 
of empirically established facts about human behavior for unconfirmed assump- 
tions and popular stereotypes. i 

In Bret nt psychologists have published an increasing number of such 
theoretical analyses of international problems. Examples of the topics thus 
analyzed include: methods for resolving intergroup conflicts (14, 27, 62), psv 
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chological mechanisms underlying “cold-war mentality" (109, 110); and the 
potential psychological effects of a civilian defense program centered around 
fallout shelters (140). Such analyses can serve to clarify issues and to suggest 
questions for consideration. But it should be noted that they are better suited 
to raising questions than to answering them. In the field of international 
relations, there may have been too much “theorizing about theory,” on the 
part of psychologists as well as other social scientists (126). 

Surveys of Available Research Findings. A second way in which psy- 
chologists may contribute is by summarizing and integrating the research 
literature on relevant problems and making it available to policy makers. An 
example is the survey of research on tensions affecting international under- 
standing, prepared by Klineberg (68) as part of a UNESCO project on this 
topic. In it are covered studies on national differences in personality, existing 
social stereotypes about such differences, the development and modification 
of attitudes, and factors making for aggression in interpersonal and intergroup 
behavior. A more intensive coverage of research pertaining to aggression is 
provided in more recent surveys devoted entirely to this topic (9, 9°). Still 
another example is to be found in a symposium on human problems in the 
utilization of fallout shelters (5). Each participant in this symposium discussed 
implications of the available research on such related problems as submarine 
habitability, polar isolation, internment, isolated radar bases, and English 
World War II bomb shelters, as well as experimental studies of sensory 
deprivation (see Chapter 8). 

Research Approaches to International Problems. The most direct con- 
tribution that psychologists can make is through research specifically designed 
within the framework of international problems. Both basic and applied re- 
search is needed on a wide variety of problems (62, 63, 69, 115). An example 
of applied research is the empirical evaluation of programs designed to im- 
prove international relations, such as student exchange, economic aid, techni- 
cal assistance and education, international cultural programs, and the dissemi- 
nation of information. So far little systematic information is available on the 
effectiveness of such programs or on the specific conditions and techniques that 
influence outcome, although a substantial beginning has been made in research 
with exchange students (22), 

Research. having both applied and basic implications is illustrated by field 
studies of intergroup hostility and conflict resolution, utilizing genuine but 
limited conflict situations. Intranational and factional strifes can be investi- 
gated through longitudinal field studies employing direct observation of events 
as well as interviews with representative samples. Examples of suitable situa- 
tions include labor strikes, minority group conflicts, and intergroup clashes in 
countries that comprise distinct cultural subgroups (63, 69, tor, 115). 

Cross-cultural attitude studies tepresent a third type of relevant research 
C59, 63, 69). Through interviews, questionnaires, and testing techniques, psy- 
chologists have investigated the images that persons have of their own nation 
and of other nations. Other studies have been concerned with the factors 
affecting national stereotypes and the conditions under which intergroup at- 
titudes change. In one series of studies, psychologists and other social scientists 
in seven countries of western Europe collaborated in cross-cultural research on 
the response to threat (59, 121). Utilizing both opinion-survey and experi- 


571 


Psychology and Law 


mental techniques, the investigators followed the same procedures in each 
country in order to obtain directly comparable data. 

From a different angle, field studies of human behavior in disaster situations 
also provide pertinent information (18, 104). Although concerned largely 
with such peacetime disasters as tornadoes, hurricanes, explosions, fires, and 
floods, this research has a direct bearing on the planning and implementation 
of civilian defense programs. A few experimental studies have also been con- 
ducted with subjects inhabiting simulated shelters for various periods of time 
Csee 104). Of particular interest are three studies of responses to unanticipated 
air raid warnings which turned out to be false alarms (88). In each of the 
three localities, random samples of adults were interviewed to learn what the 
individual had done and felt upon hearing the warning siren. A comparison 
of responses in the three situations suggested certain conditions that increase 
the effectiveness of such warning systems. 

A direct experimental approach to problems of intergroup frictions and con- 
flict resolution is illustrated by a series of studies conducted by Sherif and his 
associates (see 91; 124, chs. 6 and 9). In these experiments, boys attending a 
specially organized summer camp underwent controlled experiences designed 
first to produce and later to eliminate intergroup hostility. The investigation 
was concerned with the conditions leading to the development of intergroup 
conflict; the effects of such conflicts on behavior, attitudes, and group stereo- 
types; and the relative effectiveness of different methods for restoring inter- 
group harmony. The most successful technique for overcoming hostility proved 
to be the introduction of a significant common goal (such as obtaining food 
when the food truck had broken down) that required the cooperative efforts 
of both groups. Participation in activities involving such “superordinate goals” 
tended to reduce intergroup tensions and eliminate unfavorable group stereo- 
types. The findings of such experiments are of course limited to the type of 
subjects and situations investigated. This kind of research needs to be repeated 
with many different groups varying in experiential backgrounds and personal 
characteristics, as well as with a wider variety of techniques for the reduction 
of intergroup tensions. A 

As a final example, we may consider simulation and gaming, which are 
being widely applied to the study of international relations (26, 29-31, 36, 49, 
5o, 71, 112). Diplomatical or political games are similar to the management 
games developed for use in management training programs (see Chapter 5). 
Both were antedated by military games. Game theory is a branch of decision 
theory in which there are two or more decision makers with typically con- 
flicting interests. By means of mathematical models, rational solutions for 
such problems can be derived from the given properties of each situation. This 
approach has been used in the effort to develop a theory of political negotia- 
tions applicable to the type of decisions made in the Congress, the Department 
of State, the United Nations Security Council, and similar contexts (36, 71). 
It has also been applied to more specific problems, such as arms races and 
military deterrence (97). Of particular relevance is a series of coordinated 
research projects on psychological factors in strategic deterrence, conducted 
under the auspices of the Behavioral Sciences Group of the U.S. Naval 


Ordnance Test Station (53). i hon 
Simulation, it will be recalled, has been employed both in training (see 
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Chapter 5) and in research on man-machine systems (see Chapter 9). It in- 
volves essentially the presentation of a more or less realistic situation in 2 some- 
what simplified and condensed form. Real subjects make the decisions on the 
basis of given facts, and the outcomes of these decisions are checked periodi- 
cally. Simulation differs from the completely realistic situations illustrated by 
the previously cited Sherif experiment in that the subjects in simulation studies 
know they are playing roles. Typically, however, they become quite ego- 
involved in the game, a condition that is encouraged by the use of monetary 
and other kinds of rewards. 

In a number of recent studies, simulation has been applied within the frame- 
work of game theory. In a continuing research program, Deutsch and his 
associates (26, 29, 30) have employed simulation in two-person games to study 
the development of trust and suspicion, behavior in interpersonal bargaining 
situations, and the influence of such factors as motivational orientation, the 
presence of threat, and different kinds of communication upon outcome. 
Typically, these experiments have employed “non-zero-sum games,” in which 
both sides may gain or both may lose as a result of their combined decisions. 
Figure 129 illustrates the findings of one such experiment, in which the 
effects of threat and communication were explored. The “threat” consisted of 
the availability of a gate that could be closed by one participant, thereby pre- 
venting the other participant from using a road. It will be noted that the joint 
gain, or payoff, was highest under the condition of no threat and lowest under 
the condition of bilateral threat. Under the latter condition, moreover, com- 
munication had no significant effect, 

Guetzkow (49, 50) describes the development and application of a political 
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game in which each of several nations is represented by two or three persons 
playing specified roles. Working from given data about each nation, the sub- 
jects make a series of decisions regarding internal and external matters. Among 
the actions open to subjects are economic development, research programs, 
disarmament plans, foreign loans, alliances, and wars. Communications be- 
tween nations are either written or tape-recorded, thus permitting later analy- 
ses. Known as the Inter-nation Simulation (INS), this situation has been 
employed for both teaching and research. 

A methodologically sophisticated research project utilizing one form of 
Guetzkow's INS is reported by Driver (31). Its object was to investigate the 
effects of situational stress and of individual personality characteristics on the 
subjects’ perception of the simulated nations. A total of 357 Middle Western 
high school students of both sexes participated in the different runs of the INS. 
Each run involved seven simulated nations, each manned by three subjects 
performing specified roles for a four-day period. The runs which eventuated in 
simulated wars were classified as providing intense stress. Peacetime runs were 
subdivided into the moderately stressful and the mildly stressful. The com- 
plexity of subjects’ concepts of the simulated nations was evaluated in terms 
of differentiation, integration, and other attributes of the conceptual structure. 
It was pointed out that the more complex concepts permit a fuller and more 
balanced consideration of all relevant variables and hence facilitate effective 
decision making. As hypothesized, stress was found to have a curvilinear re- 
lation to the complexity of such concepts, the most complex concepts being 
found with intermediate degrees of stress. An independent measure of the 
generalized “cognitive complexity” of individual subjects was also significantly 
correlated with the complexity of their concepts of the simulated nations. 

Action Proposals. While many promising beginnings have been made in 
psychological research on problems of international relations, it is apparent 
that there is still a wide gap between such research and the solution of practi- 
cal problems. Nevertheless, some psychologists, as well as other social scien- 
tists, have proposed action programs in the effort to apply what knowledge is 
available (see, e.g., 108, 120, 146). Although such recommendations are not 
based wholly upon experimentally verified facts, it is felt that the psychologist, 
as an expert on human behavior, has an obligation to contribute what he can 
to problems of such immediate urgency. Even in the absence of directly 
relevant data, it is argued, the informed opinion of a technically trained spe- 
cialist on matters falling within his area of competence are bound to be more 
nearly correct than the opinions of laymen on the same matters. ; 

When functioning in the capacity of an informed citizen, the psychologist 
must exercise special caution to avoid possible pitfalls (55, 119). First, he must 
bear in mind that the solution of complex international problems requires the 
collaboration of specialists in many fields. Within psychology itself, he must 
recognize the limits of his own competence. He must also carefully differentiate 
between opinions Chowever well informed) and facts, and he must make this 
distinction clear in communicating with others. Above all, it is important to 
separate personal beliefs and preferences from one's contribution as a scientist. 
To promote personal ideas under the guise of science would be both un- 
ethical and self-defeating. It is by retaining his scientific objectivity, his aware- 
ness of factors that may distort perceptions, and his research orientation toward 
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human problems that the psychologist can make his most distinctive cont: ibu 
tion as a psychologist. This is true in the field of international relations, as it 
is in every other field of applied psychology. 


SUMMARY 


Although the practice of law is characteristically resistant to change, there is 
a growing recognition that psychology has much to offer to this field. ‘The 
earliest applications of psychology to law were concerned with testimony and 
other courtroom procedures. Many of the findings of general experimental 
psychology are applicable to the evaluation of testimony. In addition, the 
conditions affecting the accuracy of observation and report have been inv- sti- 
gated through Aussage tests and other controlled procedures. More recently, 
a limited amount of research has been conducted on the behavior of j ges 
and juries. A major area of legal psychology is concerned with lie detection. 
The principal techniques explored for this purpose are free association and 
physiological indices of emotion, as illustrated by changes in respiration, blood 
pressure, and galvanic skin response. Psychologists have also given some at- 
tention to the nature and effects of certain traditional methods of stressful 
interrogation, 

Together with members of several other professions, psychologists have 
investigated the characteristics of delinquents and criminals in the effort to 
identify the causes of antisocial behavior, Although the conditions that lead 
to crime are manifold, research findings have given increasing prominence to 
emotional and motivational factors, stemming at least in part from early child- 
hood experiences. In the treatment of delinquents and criminals, psychological 
services are needed at every stage. But the number of psychologists employed 
in correctional services still falls far short of estimated requirements. The work 
of psychologists in court clinics, prisons, training schools for juvenile delin- 
quents, and other penocorrectional institutions falls chiefly within the areas of 
clinical and counseling psychology. In all these settings, the emphasis is 
primarily on diagnostic functions, although increasing attention is being given 
to therapy and rehabilitation. 

Since midcentury, psychologists have been called upon increasingly to testify 
as expert witnesses in court trials. Although clinical psychologists have most 
often functioned in this capacity, contributions have also been made by psy- 
chologists in other specialties, such as experimental, consumer, and ‘social 
psychology. Psychologists may also aid in the development, revision, and 
interpretation of laws by submitting relevant research findings in court testi- 
mony, by serving as consultants to legislative committees, and by collaborating 
in law school teaching and research. Empirically established facts about human 
behavior are of basic importance in many types of laws, such as those pertain- 
ing to insanity, drug addiction, alcoholism, sexual offenses, child development 
and education, civil rights, and minority groups. 

At a still broader level, psychologists are beginning to make contributions 
to national and international affairs. While recognizing that the prevention 
of wars and the solution of other complex international problems requires the 

collaborative efforts of specialists from many fields, psychologists can use their 
specialized knowledge, skills, and points of view to advance understanding 
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of such problems. Their contributions include both the utilization of cur- 
rently available psychological knowledge and the performance of specially 
designed research on many facets of international problems. 
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APPENDIX A 
DIVISIONS OF THE AMERICAN 
PSYCHOLOGICAL ASSOCIATION* 


O 0O NOVU N = 


. Division of General Psychology 

. Division on the Teaching of Psychology 

. Division of Experimental Psychology 

. Division of Evaluation and Measurement 

. Division of Physiological and Comparative Psychology 
. Division on Developmental Psychology 

. Division of Personality and Social Psychology 

. The Society for the Psychological Study of Social Issues—a Division of the APA 
. Division of Psychology and the Arts 

. Division of Clinical Psychology 

. Division of Consulting Psychology 

. Division of Industrial Psychology 

. Division of Educational Psychology 

. Division of School Psychologists 

. Division of Counseling Psychology 

. Division of Psychologists in Public Service 

. Division of Military Psychology 

. Division on Maturity and Old Age 

. The Society of Engineering Psychologists—a Division of the APA 
. Division on Psychological Aspects of Disability 

. Division of Consumer Psychology 

. Division of Philosophical Psychology 

. Division for the Experimental Analysis of Behavior 

. Division of the History of Psychology 


Total membership in 1966: 24,473 


* Owing to reorganizations and other changes made since the introduction of a divisional 
structure in the APA, there are currently no divisions numbered 4 or 11. 
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APPENDIX B 
MAJOR DIRECTORIES OF PSYCHOLOGISTS 


Directories are published by the following organizations: 


American Psychological Association, Inc. 
1200 Seventeenth Street, N.W. 
Washington, D.C. 20036 


Directory lists all members of the American Psychological Association alpha- 
betically, geographically, and by divisional affiliation. It provides personal dats such 
as current address, education, occupational history, present occupation, fields of 
special interest, certification, diplomate status, etc. 


American Board of Examiners in Professional Psychology, Inc. 
Dr. Noble Kelley, Secretary 

Southern Illinois University 

Carbondale, Illinois 


Directory lists all diplomates in clinical, counseling, and industrial psyc ^ology, 
together with personal data. 


State Directories 


By 1965, the following states had enacted legislation regarding the certification or 
licensing of psychologists: 


Alabama Michigan 
Arizona Minnesota 
Arkansas Nevada 
California New Hampshire 
Colorado New Mexico 
Connecticut New York 
Delaware Oklahoma 
Florida Oregon 

Georgia Tennessee 
Idaho Utah 

Illinois Virginia 
Kentucky Washington 
Louisiana Wyoming 
Maine 

Maryland Canadian Provinces of Alberta, 


Quebec, Ontario, and Saskatchewan 


In such states, lists of certified psychologists may be obtained by writing to the 
appropriate agency within the state government. Information may be obtained 
from the local public library or from any university psychology department within 
the state. 


Many state psychological associations also publish lists of their members, with in- 
formation regarding their qualifications. 


APPENDIX C 
ETHICAL STANDARDS OF PSYCHOLOGISTS* 


The psychologist believes in the dignity and worth of the individual human be- 
ing. He is committed to increasing man's understanding of himself and others. 
While pursuing this endeavor, he protects the welfare of any person who may seek 
his service or of any subject, human or animal, that may be the object of his study. 
He does not use his professional position or relationships, nor does he knowingly 
permit his own services to be used by others, for purposes inconsistent with these 
values. While demanding for himself freedom of inquiry and communication, he ac- 
cepts the responsibility this freedom confers: for competence where he claims it, 
for objectivity in the report of his findings, and for consideration of the best in- 
terests of his colleagues and of society. 


Specific Principles 


Principle 1. Responsibility. The psychologist,t committed to increasing man's 


understanding of man, places high value on objectivity and integrity, and maintains 
the highest standards in the services he offers. 


a. 


As a scientist, the psychologist believes that society will be best served when he 
investigates where his judgment indicates investigation is needed; he plans his 
research in such a way as to minimize the possibility that his findings will be mis- 
leading; and he publishes full reports of his work, never discarding without ex- 
planation data which may modify the interpretation of results. 


. As a teacher, the psychologist recognizes his primary obligation to help others 


acquire knowledge and skill, and to maintain high standards of scholarship. 


. As a practitioner, the psychologist knows that he bears a heavy social responsi- 


bility because his work may touch intimately the lives of others. 


Principle 2. Competence. The maintenance of high standards of professional 


competence is a responsibility shared by all psychologists, in the interest of the public 
and of the profession as a whole. 


a. 


c. 


* From Amer. Psychologist, 1963, 18, 56-60. 
+A student of psychology who assumes th 


Psychologists discourage the practice of psychology by unqualified persons and 
assist the public in identifying psychologists competent to give dependable pro- 
fessional service. When a psychologist or a person identifying himself as a psy- 
chologist violates ethical standards, psychologists who know firsthand of such 
activities attempt to rectify the situation. When such a situation cannot be dealt 
with informally, it is called to the attention of the appropriate local, state, or 
national committee on professional ethics, standards, and practices. 


. The psychologist recognizes the boundaries of his competence and the limita- 


tions of his techniques and does not offer services or use techniques that fail to 
meet professional standards established in particular fields. The psychologist who 
engages in practice assists his client in obtaining professional help for all im- 
portant aspects of his problem that fall outside the boundaries of his own com- 
petence. This principle requires, for example, that provision be made for the 
diagnosis and treatment of relevant medical problems and for referral to or 


consultation with other specialists. P ; 1 k 
The psychologist in clinical work recognizes that his effectiveness depends in 


e role of psychologist shall be considered a 


psychologist for the purpose of this code of ethics. 
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good part upon his ability to maintain sound interpersonal relations, that tempo- 
rary or more enduring aberrations in his own personality may interfere with this 
ability or distort his appraisals of others. Therefore he refrains from undertaking 
any activity in which his personal problems are likely to result in inferior pro- 
fessional services or harm to a client; or, if he is already engaged in such an 
activity when he becomes aware of his personal problems, he seeks competent 
professional assistance to determine whether he should continue or terminate his 
services to his client. 


Principle 3. Moral and Legal Standards. ‘The psychologist in the practice of his 
profession shows sensible regard for the social codes and moral expectations of the 
community in which he works, recognizing that violations of accepted moral and 
legal standards on his part may involve his clients, students, or colleagues in 
damaging personal conflicts, and impugn his own name and the reputation of his 
profession. 


Principle 4. Misrepresentation. The psychologist avoids misrepresentation of 
his own professional qualifications, affiliations, and purposes, and those of the in- 
stitutions and organizations with which he is associated. 


a. A psychologist does not claim either directly or by implication professional 
qualifications that differ from his actual qualifications, nor does he misrepresent 
his affiliation with any institution, organization, or individual, nor lead others 
to assume he has affiliations that he does not have. The psychologist is respon- 
sible for correcting others who misrepresent his professional qualifications or 
affiliations. 

b. The psychologist does not misrepresent an institution or organization with which 
he is affliated by ascribing to it characteristics that it does not have. 

c. A psychologist does not use his affiliation with the American Psychological As- 
sociation or its Divisions for purposes that are not consonant with the stated 
purposes of the Association. 

d. A psychologist does not associate himself with or permit his name to be used in 
connection with any services or products in such a way as to misrepresent them, 
the degree of his responsibility for them, or the nature of his affiliation. 


Principle 5. Public Statements. Modesty, scientific caution, and due regard for 
the limits of present knowledge characterize all statements of psychologists who 
supply information to the public, either directly or indirectly. 


a. Psychologists who interpret the science of psychology or the services of psy- 
chologists to clients or to the general public have an obligation to report fairly 
and accurately. Exaggeration, sensationalism, superficiality, and other kinds of 
misrepresentation are avoided. 

b. When information about psychological procedures and techniques is given, care 
is taken to indicate that they should be used only by persons adequately 
trained in their use. 

c. A psychologist who engages in radio or television activities does not participate 
in commercial announcements recommending purchase or use of a product. 


Principle 6. Confidentiality. Safeguarding information about an individual that 
has been obtained by the psychologist in the course of his teaching, practice, or 
investigation is a primary obligation of the psychologist. Such information is not 
communicated to others unless certain important conditions are met. 


Appendix C 


a. Information received in confidence is revealed only after most careful delibera- 
tion and when there is clear and imminent danger to an individual or to society, 
and then only to appropriate professional workers or public authorities. 

b. Information obtained in clinical or consulting relationships, or evaluative data 
concerning children, students, employees, and others are discussed only for pro- 
fessional purposes and only with persons clearly concerned with the case. 
Written and oral reports should present only data germane to the purposes of 
the evaluation; every effort should be made to avoid undue invasion of privacy. 

c. Clinical and other case materials are used in classroom teaching and writing 
only when the identity of the persons involved is adequately disguised. 

d. The confidentiality of professional communications about individuals is main- 
tained. Only when the originator and other persons involved give their express 
permission is a confidential professional communication shown to the individual 
concerned. The psychologist is responsible for informing the client of the limits 
of the confidentiality. 

e. Only after explicit permission has been granted is the identity of research sub- 
jects published. When data have been published without permission for identi- 
fication, the psychologist assumes responsibility for adequately disguising their 
sources. 

f. The psychologist makes provision for the maintenance of confidentiality in the 
preservation and ultimate disposition of confidential records. 


Principle 7. Client Welfare. The psychologist respects the integrity and protects 
the welfare of the person or group with whom he is working. 


a. The psychologist in industry, education, and other situations in which conflicts 
of interest may arise among various parties, as between management and labor, 
or between the client and employer of the psychologist, defines for himself the 
nature and direction of his loyalties and responsibilities and keeps all parties 
concerned informed of these commitments. 

b. When there is a conflict among professional workers, the psychologist is con- 
cerned primarily with the welfare of any client involved and only secondarily 
with the interest of his own professional group. 

c. The psychologist attempts to terminate a clinical or consulting relationship when 
it is reasonably clear to the psychologist that the client is not benefiting from it. 

d. The psychologist who asks that an individual reveal personal information in the 
course of interviewing, testing, or evaluation, or who allows such information to 
be divulged to him, does so only after making certain that the responsible person 
is fully aware of the purposes of the interview, testing, or evaluation and of the 
ways in which the information may be used. 

e. In cases involving referral, the responsibility of the psychologist for the welfare 
of the client continues until this responsibility is assumed by the professional 
person to whom the client is referred or until the relationship with the psy- 
chologist making the referral has been terminated by mutual agreement. In 
situations where referral, consultation, or other changes in the conditions of the 
treatment are indicated and the client refuses referral, the psychologist carefully 
weighs the possible harm to the client, to himself, and to his profession that 
might ensue from continuing the relationship. 7 ARDI 

f. The psychologist who requires the taking of psychological tests for didactic, 
classification, or research purposes protects the examinees by insuring that the 
tests and test results are used in a professional manner. 14 

g. When potentially disturbing subject matter is presented to students, it is dis- 
cussed objectively, and efforts are made to handle constructively any difficulties 


that arise. 
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h. Care must be taken to insure an appropriate setting for clinical work to protect 
both client and psychologist from actual or imputed harm and the profession 
from censure. 


Principle 8. Client Relationship. The psychologist informs his prospective 
client of the important aspects of the potential relationship that might affect the 
client's decision to enter the relationship. 


a. Aspects of the relationship likely to affect the client's decision include the re- 
cording of an interview, the use of interview material for training purposes, and 
observation of an interview by other persons. 

b. When the client is not competent to evaluate the situation (as in the cose of a 
child), the person responsible for the client is informed of the circumstances 
which may influence the relationship. 

c. The psychologist does not normally enter into a professional relationship with 
members of his own family, intimate friends, close associates, or others whose 
welfare might be jeopardized by such a dual relationship. 


Principle 9. Impersonal Services. Psychological services for the purpose of 
diagnosis, treatment, or personalized advice are provided only in the context of a 
professional relationship, and are not given by means of public lectures or demon- 
strations, newspaper or magazine articles, radio or television programs, mail, or 
similar media. 


a. The preparation of personnel reports and recommendations based on :cst data 
secured solely by mail is unethical unless such appraisals are an integra! part of 
a continuing client relationship with a company, as a result of which the con- 
sulting psychologist has intimate knowledge of the client's personnel situation 
and can be assured thereby that his written appraisals will be adequate to the 
purpose and will be properly interpreted by the client. These reports must not 
be embellished with such detailed analyses of the subject's personality traits as 
would be appropriate only after intensive interviews with the subject. The re- 
ports must not make specific recommendations as to employment or placement of 
the subject which go beyond the psychologists knowledge of the job require- 
ments of the company. The reports must not purport to eliminate the company's 
need to carry on such other regular employment or personnel practices as ap- 
praisal of the work history, checking of references, and past performance in the 
company. 


Principle 10. Announcement of Services. A psychologist adheres to professional 
rather than commercial standards in making known his availability for professional 
services. 


a. A psychologist does not directly solicit clients for individual diagnosis or therapy: 

b. Individual listings in telephone directories are limited to name, highest relevant 
degree, certification status, address, and telephone number. They may also in- 
clude identification in a few words of the psychologist’s major areas of prac 
tice; for example, child therapy, personnel selection, industrial psychology- 
Agency listings are equally modest. 

c. Announcements of individual private practice are limited to a simple statement 
of the name, highest relevant degree, certification or diplomate status, address, 
telephone number, office hours, and a brief explanation of the types of services 
rendered. Announcements of agencies may list names of staff members with 
their qualifications. They conform in other particulars with the same standards 
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as individual announcements, making certain that the true nature of the or- 
ganization is apparent. 

d. A psychologist or agency announcing nonclinical professional services may use 
brochures that are descriptive of services rendered but not evaluative. They may 
be sent to professional persons, schools, business firms, government agencies, and 
other similar organizations. 

e. The use in a brochure of “testimonials from satisfied users” is unacceptable. The 
offer of a free trial of services is unacceptable if it operates to misrepresent in 
any way the nature or the efficacy of the services rendered by the psychologist. 
Claims that a psychologist has unique skills or unique devices not available to 
others in the profession are made only if the special efficacy of these unique 
skills or devices has been demonstrated by scientifically acceptable evidence. 

f. The psychologist must not encourage (nor, within his power, even allow) a 
client to have exaggerated ideas as to the efficacy of services rendered. Claims 
made to clients about the efficacy of his services must not go beyond those which 
the psychologist would be willing to subject to professional scrutiny through pub- 
lishing his results and his claims in a professional journal. 


Principle 11. Interprofessional Relations. A psychologist acts with integrity in 
regard to colleagues in psychology and in other professions. 


a. A psychologist does not normally offer professional services to a person receiv- 
ing psychological assistance from another professional worker except by agree- 
ment with the other worker or after the termination of the client's relationship 
with the other professional worker. 

b. The welfare of clients and colleagues requires that psychologists in joint practice 
or corporate activities make an orderly and explicit arrangement regarding the 
conditions of their association and its possible termination. Psychologists who 
serve as employers of other psychologists have an obligation to make similar ap- 
propriate arrangements. 


Principle 12. Remuneration. Financial arrangements in professional practice 
are in accord with professional standards that safeguard the best interest of the 
client and the profession. 


a. In establishing rates for professional services, the psychologist considers carefully 
both the ability of the client to meet the financial burden and the charges made 
by other professional persons engaged in comparable work. He is willing to 
contribute a portion of his services to work for which he receives little or no 
financial return. j f 

b. No commission or rebate or any other form of remuneration is given or received 
for referral of clients for professional services. ` t ; 

c. The psychologist in clinical or counseling practice does not use his relationships 
with clients to promote, for personal gain or the profit of an agency, commercial 
enterprises of any kind. ‘ 

d. A psychologist does not accept a private fee or any other form of remuneration 
for professional work with a person who is entitled to his services through an 
institution or agency. The policies of a particular agency may make explicit 
provision for private work with its clients by members of its staff, and in such 
instances the client must be fully apprised of all policies affecting him. 


Principle 13. Test Security. Psychological tests and other assessment devices, 
the value of which depends in part on the naivete of the subject, are not reproduced 
or described in popular publications in ways that might invalidate the techniques. 
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Access to such devices is limited to persons with professional interests who will 
safeguard their use. 


a. Sample items made up to resemble those of tests being discussed may be repro- 
duced in popular articles and elsewhere, but scorable tests and actual test items 
are not reproduced except in professional publications. 

b. The psychologist is responsible for the control of psychological tests and other 
devices and procedures used for instruction when their value might be damaged 
by revealing to the general public their specific contents or underlying principles. 


Principle 14. Test Interpretation. "Test scores, like test materials, arc released 
only to persons who are qualified to interpret and use them properly. 


a. Materials for reporting test scores to parents, or which are designed for self- 
appraisal purposes in schools, social agencies, or industry are closely supervised 
by qualified psychologists or counselors with provisions for referring and coun- 
seling individuals when needed. 

b. Test results or other assessment data used for evaluation or classification are com- 
municated to employers, relatives, or other appropriate persons in such a manner 
as to guard against misinterpretation or misuse. In the usual case, an interpreta- 
tion of the test result rather than the score is communicated. 

c. When test results are communicated directly to parents and students, they are 
accompanied by adequate interpretive aids or advice. 


Principle 15. Test Publication. Psychological tests are offered for commercial 
publication only to publishers who present their tests in a professional way and 
distribute them only to qualified users. 


a. A test manual, technical handbook, or other suitable report on the test is pro- 
vided which describes the method of constructing and standardizing the test, and 
summarizes the validation research. 

b. The populations for which the test has been developed and the purposes for 
which it is recommended are stated in the manual. Limitations upon the test's 
dependability, and aspects of its validity on which research is lacking or incom- 
plete, are clearly stated. In particular, the manual contains a warning regarding 
interpretations likely to be made which have not yet been substantiated by 
research. 

c. The catalog and manual indicate the training or professional qualifications re 
quired for sound interpretation of the test. 

d. The test manual and supporting documents take into account the principles 
enunciated in the Technical Recommendations for Psychological Tests and 
Diagnostic Techniques. | 


e. Test advertisements are factual and descriptive rather than emotional and per 
suasive, 


Principle 16. Research Precautions. The psychologist assumes obligations for 
the welfare of his research subjects, both animal and human. 


a. Only when a problem is of scientific significance and it is not practicable to in- 
vestigate it in any other way is the psychologist justified in exposing research 
subjects, whether children or adults, to physical or emotional stress as part of an 
investigation. 

b. When a reasonable possibility of injurious aftereffects exists, research is COT" 
ducted only when the subjects or their responsible agents are fully informed of 
this possibility and agree to participate nevertheless. i 
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c. 


Appendix C 


The psychologist seriously considers the possibility of harmful aftereffects and 
avoids them, or removes them as soon as permitted by the design of the ex- 
periment. 

A psychologist using animals in research adheres to the provisions of the Rules 
Regarding Animals, drawn up by the Committee on Precautions and Standards 
in Animal Experimentation and adopted by the American Psychological Associa- 
tion. 


Principle 17. Publication Credit. Credit is assigned to those who have con- 


tributed to a publication, in proportion to their contribution, and only to these. 


a. 


Major contributions of a professional character, made by several persons to a 
common project, are recognized by joint authorship. The experimenter or author 
who has made the principal contribution to a publication is identified as the first 
listed. 


. Minor contributions of a professional character, extensive clerical or similar 


nonprofessional assistance, and other minor contributions are acknowledged in 
footnotes or in an introductory statement. 


. Acknowledgment through specific citations is made for unpublished as well as 


published material that has directly influenced the research or writing. 


. A psychologist who compiles and edits for publication the contributions of others 


publishes the symposium or report under the title of the committee or sym- 
posium, with his own name appearing as chairman or editor among those of the 
other contributors or committee members. 


Principle 18. Responsibility toward Organization. A psychologist respects the 


rights and reputation of the institute or organization with which he is associated. 


a. 


Materials prepared by a psychologist as a part of his regular work under spe- 
cific direction of his organization are the property of that organization. Such 
materials are released for use or publication by a psychologist in accordance 
with policies of authorization, assignment of credit, and related matters which 
have been established by his organization. 


. Other material resulting incidentally from activity supported by any agency, and 


for which the psychologist rightly assumes individual responsibility, is published 
with disclaimer for any responsibility on the part of the supporting agency. 


Principle 19. Promotional Activities. The psychologist associated with the de- 


velopment or promotion of psychological devices, books, or other products offered for 
commercial sale is responsible for ensuring that such devices, books, or products are 
presented in a professional and factual way. 


a. 


b. 


Claims regarding performance, benefits, or results are supported by scientifically 
acceptable evidence. 

The psychologist does not use professional journals for the commercial exploita- 
tion of psychological products, and the psychologist-editor guards against such 
misuse. 

The psychologist with a financial interest in the sale or use of a psychological 
product is sensitive to possible conflict of interest in his promotion of such prod- 
ucts and avoids compromise of his professional responsibilities and objectives. 


APPENDIX D 
AN EXAMPLE OF A JOB DESCRIPTION* 


Dry Cleaner (Hand) 


Job Summary 
Removes dirt and spots from drapes, window shades, lamp shades, leather goods, 
and fabrics too delicate to be machine dry-cleaned, by scrubbing them with a 
solution of soap and cleaning solvent. 


Work Performed 

1. Examines article to be cleaned: Observes texture and condition of fabric, and the 
nature of the spot, to decide how intensive a scrubbing action the material 
will withstand without injury; reads note attached to article to learn extent of 
cleaning desired by customer. 

2. Cleans article: Spreads and holds article flat on scrubbing table so that scrubbing 
action will not tear or injure fabric; dips cup into solution of soap and cleaning 
solvent and pours it over spot; moistens brush of appropriate stiffness in solu- 
tion and scrubs coarse fabrics with a vigorous motion in all directions; scrubs 
fine fabrics with a gentle motion in only one direction to avoid injury to 
material; scrapes hardened spots with a bone spatula; rinses article in a trough 
containing clean solvent to rinse out the dirt; may manipulate valves to drain 
and refill rinsing trough, and give article a final rinse in the clean solution. 
(The same kind of cleaning solvent is used for all fabrics.) 

3. May assist FURNITURE WASHER to move large pieces to and from drying room. 


Equipment 
Scrubbing table: A waist-high table with a wooden, stone, or metal top on which 
articles are scrubbed and rinsed to clean them. Usually the table is provided 
with a low rim around its edges and slopes gradually to a trough so that liquids 
will drain into the trough instead of spilling onto the floor. 


Relation to Other Jobs 
Promotion to: DRY CLEANER. 
Transfer from and to: FURNITURE WASHER. 


Job Combination: The duties of this job may be included in those of FURNITURE 
WASHER Or DRY-CLEANER HELPER. 


Specialized Qualifications 
Many establishments hire inexperienced workers and train them in the methods 
characteristic of the establishment. 


Ability to ascertain the strength of the various fabrics to prevent damage by ex 
cessive scrubbing, is desirable. 


* From U.S. Employment Service, Job Analysis and Information Section. Job descrip- 


a for m cleaning, dyeing, and pressing industry. Washington: Govt. Printing Office, 
1935, p. OF. 


APPENDIX E 
TEST PUBLISHERS 


Below are the names and addresses of some of the larger American publishers and 
distributors of psychological tests. Catalogues of current tests can be obtained from 
these publishers on request. Manuals and specimen sets of tests can be purchased 
by qualified users. For names and addresses of other test publishers, see the Pub- 
lisher's Directory in the latest Mental Measurements Yearbook. 


California Test Bureau, Del Monte Research Park, Monterey, California 

Consulting Psychologists Press, Inc., 577 College Avenue, Palo Alto, California 

Educational Test Bureau, 720 Washington Avenue, S.E., Minneapolis 14, Min- 
nesota 

Educational Testing Service, Cooperative Test Division, 20 Nassau Street, Prince- 
ton, New Jersey 

Harcourt, Brace & World, Inc., Test Department, 757 Third Ave., New York 17, 
New York 

Houghton Mifflin Company, 2 Park Street, Boston 7, Massachusetts 

E for Personality and Ability Testing, 1602 Coronado Drive, Champaign, 

1noIs 

Psychological Corporation, 304 East 45th Street, New York 17, New York 

Psychological Test Specialists, Box 1441, Missoula, Montana 

Psychometric Affiliates, Box 1625, Chicago 9o, Illinois 

Public School Publishing Company, 345 Calhoun Street, Cincinnati 19, Ohio 

Science Research Associates, Inc., 259 East Erie Street, Chicago 11, Illinois 

Sheridan Supply Company, P.O. Box 837, Beverly Hills, California 

C. H. Stoelting Company, 424 North Homan Avenue, Chicago 24, Illinois 

Western Psychological Services, Box 775, Beverly Hills, California 
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of outcome, in clinical psychology, 345-347 
Preferences, for brands, 273-274 
for foods, 275-277 
for radio and television programs, 262-264 
Prenatal factors, in mental disorders, 412-413 
Pre-positioning of tools, 223 
Prevention, in mental health movement, 
387-388 
Preventive medicine, and consumer psychol- 
ogy, 251-252 
psychological techniques in, 522 
Printed advertisements, methods for testing 
effectiveness, 252-260 
Printing types, in advertisements, 292-295 
Prison psychologist, 562—563 
Problem solving, research on, 496-498 
Problem-solving interview, 109, 124 
Process research on psychotherapy, 400-404 
Product design, 221, 274-276 
Product image, 303-308 
Product personality, 303-308 
Product testing, 273-277 
Prognosis, in clinical psychology, 345-347 
Program Analyzer, 262-265 
Programmed learning, 121—122, 505-510 
branching, 506-508 
intrinsic programming, 508 
linear programming, 504—505 
response shaping, 506-507 
use of computers in, 510 
use with delinquents, 509 
vanishing, 506—507 
Progressive Matrices, 46-47, 355 
Projective techniques, 68-70 
in clinical psychology, 359-363 
evaluation of, 362-363, 396-400 
in motivation research, 297-300 
in study of employee attitudes, 137 
use with amputees, 538-539 
Psychiatrist, definition, 13 
Psychoanalysis, 382-383 
and psychosomatic medicine, 533 
repression in, 382-383, 416 
transference in, 383, 385, 416 
Psychoanalyst, definition, 13 
Psychodrama, 386 
Psychological tests, in clinical diagnosis, 348, 
352-366 
in counseling, 433-444 
in education, 491, 493-497 
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Psychological tests, publishers of, 593 
sources of information on, 43-44 
Psychomotor performance, research on, 178— 
181 
Psychopharmacology, 373-376, 404, 408- 
409, 413, 524-526 
Psychoses, 342-343 
causes, 409-415 
in different cultures, 414-415 
Psychosomatic medicine, 532-537 
Psychosurgery, 372 
Psychotherapy, 379-387 
changes in course of, 400-404 
client-therapist relation, 379, 383, 384, 
386, 402-403 
outcome of, 404-409 
process research on, 400-404 
relation to client variables, 381-382, 406- 
407 
role of learning in, 379-381 
and socioeconomic level, 381-382 
therapist variables, 402-403 
varieties of, 382—387 
Psychotomimetic drugs, 373, 413 
Public health (see Mental health ; Preven- 
tion; Preventive medicine) 
Public opinion polls, 251 
Pulse, 261 
Purdue Eye Camera, 255-256, 320 
Purdue Hand Precision Test, 196 
Purdue Pegboard, 54, 56 
Purdue Vocational Tests, 63 
Purkinje phenomenon, 548 
Pursuit tracking, 232 


Q sort, 363-364 
Questionnaires, in consumer research, 271— 
272 
in employee attitude surveys, 136 
in personality testing, 66-68, 357-359, 
438, 441 
in radio and television research, 261 
Questions, in consumer research, 271 
direct, 300—301 
effect on accuracy of legal testimony, 549- 
550 
Quickening, in man-machine systems, 240 


Racial segregation cases, psychologists’ con- 
tributions to, 567 

Radio-range signals, in aerial navigation, 231 

Radio research, 260-265 

Random sample, 269 

Rapport, 349 

Rare events, in clinical judgment, 351 
distribution of, 185 

Rating, 76-88 
factors affecting accuracy, 84-88 
techniques, 77-83 

Rating scales, 78-80 
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Reactional biography, 94-95 
Readability, 331-335 
evaluation, 334-335 
formulas, 332—333 
relation to readership, 334 
Readership surveys, 253-255, 311-319 
factor analysis in, 319 
mock, 326-327 
and readability, 334 
Readiness, in education, 490—491, 493-495 
Reading ease, of reading matter, 332—333 
Reality testing, 441 
Recognition tests, for advertisements, 253- 
254 
Recovery, Inc., 386 
Reduced cues, in advertising, 323 
References, in personnel selection, 92—94 
Rehabilitation, of accident-prone drivers, 187 
psychologists' roles in, 521-522 
Rehabilitation counseling, 429-430 
Reliability, of tests, 25-27 
Reminder ads, 289, 291 
Repetition, and attention value of ads, 313- 
315, 320 
with variation, 323-324 
Repression, in psychoanalysis, 382-383, 416 
Respiration calorimeter, 165 
Response styles, 272 
Rest pauses, 170—171, 173-174 
Retirement, research on, 476 
Rhythms, physiological, 172 
Role Construct Repertory Test, 365-366 
Role playing, in clinical psychology, 386 
in industrial training, 125-126 
in motivation research, 297-300 
Rorschach Inkblots, 33, 360-361, 398-400 
Rosenzweig Picture-Frustration Study, 361- 
362 
adaptation for motivation research, 297 
Rotation, effects on human performance, 206 
Rotation of jobs, 155, 169 


Safety (see Accident prevention) 

Sales personality, 267—268 

Sales tests of advertising effectiveness, 258, 
260 

in radio and television advertising, 264- 

265 

Sales training, 266, 268 

Salesman selection, 65, 266 

Salesmanship, 265-268 

Sampling error, 30 

Sampling problems, in consumer research, 
269-271 


in evaluating outcomes of therapy, 406- 


407 
Scaled checklists, 79-81 
(See also Attitude scales) 
Schizophrenia, 342 
causes, 409-415 
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Schizophrenia, and child-rearing practices, 
414 
cultural factors, 414-415 
and learning theory, 416 
major research projects on, 409-410 
physiological differences from normals, 
413 
effect of institutionalization on, 413 
process versus reactive, 346 
psychotherapy with, 381 
Schizophrogenic mother, 414 
School psychologists, 483-486 
Scientific method, in applied psychology, 6— 
10 
Seasons, and output of industrial workers, 203 
Self-acceptance, 363-364, 385, 401—402 
Self concepts, changes during psychotherapy, 
380, 385, 401—402 
in clinical psychology, 363-366 
in occupational psychology, 459-460, 462, 
471, 473 
Self-report inventories (see Personality in- 
ventories ) 
Selling, 265-268 
Semantic Differential, 306, 324, 364-365 
Semicircular work space, 223-225 
empirical modification of, 225 
Sensitivity training, 125 
Sensory deprivation, 208, 213, 416, 524, 556 
Sentence completion tests, 359-360 
Sequential Tests of Educational Progress, 
495-497 
Serial movements, analysis of, 180-181 
Servomechanism, 239 
Shift work, 172-173 
Shock therapy, 372-373 
Significance levels, 31 
Significant difference, 30-31 
Simplified typewriter keyboard, 225-227 
Simulation, 119-121, 127, 243-244, 571-573 
Simulators, 119-121, 243-244 
Situational tests, 70-71 
clinical interviews as, 350 
Size, and attention value of ads, 311-313 
in relation to product use, 312-313 
Slogans, 323, 324 
Social desirability variable, 67-68, 271-272 
Social expectancy, and family resemblance, 
410-411 
Social psychology, in industry, 133-158 
in teacher training, 492 
Social stereotypes, 11, 85 
Social workers, in environmental therapy, 
377 
in gathering case-history data, 348 
Socioeconomic level, and incidence of mental 
disorders, 415 
and medical practice, 520 
and psychotherapy, 381-382 
Sociogram, 137-138 


Subject Index 


Sociometry, 82, 137, 153-155, 209 
Somatopsychology, 537-541 
Somatotherapy, 371-376 
Sonar, 231 
Sorting tests, 355-356 
Space travel, environmental factors in, 205- 
209 
remote control of vehicles, 240 
system simulation, 243-244 
training devices and simulators, 121-122 
Special aptitude tests, 53-58, 435 
Specialization, in industry, 155 
Specificity of tests, 27-28 
Speech intelligibility, 210-211 
Split-run technique, in advertising research, 
258-259, 334 
Standard deviation, 29 
Standard scores, 29 
Stanford-Binet Intelligence Scale, 352-353 
Stanine, 31 
Starch scores, for advertisements, 253, 311- 
320 
State certification of psychologists, 11, 584 
Statistical significance, 30-31 
Statistical techniques, for clinical psychol- 
ogy, 395-396 
Stereotype accuracy, in personality assess- 
ment, 396 
Stereotypes, associated with brands of gaso- 
line, 298—299 
associated with use of instant foods, 297- 
298 
in operation of controls, 235-236 
in personnel appraisal, 85 
Stimulus generalization, in development of 
emotional disorders, 417-418 
Stratified sample, 269 
Strong Vocational Interest Blank, 65, 266, 
438, 440, 461, 470 
Student-centered teaching, 503-504 
Study of Values, 66, 301 
Subliminal perception, 328-331 
relation to response indicator, 330-331 
Suggestion in advertising, indirect, 291—296 
negative, 289 
Supervision, in industry, 116, 134, 141-144 
Supervisory training, 115-116, 123-126 
Survey of Interpersonal Values, 66 
Synthetic validity, 33-34 
System simulation, 243-244 
Systems development, in education, 510-511 
in engineering psychology, 238-244 


Tachistoscope, in advertising research, 255- 
257, 320-321 
(See also VISTA ) 
Tactile language, 232 
Tactual discrimination, in communication, 
232 
of shapes, 237-238 
Task analysis, in training programs, 109-110 
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Taxonomy of Educational Objectives, 495 

Teacher, as training medium, 503-504 

Teacher characteristics, 500-502 

Teacher training, psychology in, 487-492 

Teaching machines, 121-123, 504-510 

Teaching media, 502-511 

Telephone inquiries, in radio and television 
research, 261 

Television, in education, 502-503 

in industrial training, 106-107 

Television research, 260-265 

Temperature, 200-201 

Test of Diagnostic Skills, 527-528 

Testimony, psychological factors in, 548- 
550 

Testing ( see Psychological tests ) 

Thematic Apperception Test, 69, 361, 398- 
400 

adaptations for motivation research, 297 

Therapeutic community, 377-378 

Therapist responses, effect on client re- 
sponses, 402—403 

Therapist variables, effect on psychotherapy, 
402-403 

Therapy, with children, role of parents in, 


377 
in clinical psychology, 371 
environmental, 376-379 
group, 385-386 
play, 386-387 
preventive, 387-390 
psychotherapy, 379-387 
somatic, 371-376 
Therblig, 174-175 
Time-and-motion study, 173-177, 181 
Tomkins-Horn Picture Arrangement Test, 
69-70 
Tracking, research on, 232-233 
Trade characters, 324-326 
Trade names, and ad recognition, 327 
associations with, 298-299, 303-308 
confusion, 328 
ease of pronouncing, 296 
generic, 327—328 
as reduced cues, 323-328 
testing effectiveness of, 324-327 
Trade tests, 62-64 
Trademarks, 303-305, 323-327 
testing effectiveness of, 324-327 
Traffic research, 183-188 
Training in industry, automated, 117-122 
case method, 125, 127 
versus education, 104-105 
evaluation of, 112—116 
of interviewers, 97 
learning prínciples in, 111-112 
management games, 126-127 
media, 106-109 
role playing, 125-126 
multiple, 126 
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Training in industry, of salesmen, 266, 268 
scope, 104—109 
types of, 105-106 
Training devices, 117—120, 122 
Training schools, psychologists in, 563-564 
Trait acquaintance, 86-87 
Tranquilizers, 373-376, 404, 409 
Transfer of training, 112, 489 
Transference, in psychoanalysis, 383, 385, 
416 
Tropical fatigue, 204 
Truck drivers, research on, 188, 208 
Tuberculosis, psychological research on, 538- 
541 
Twins, in research on causes of mental dis- 
orders, 411-412 
Typewriter keyboard, proposed rearrange- 
ment, 225-227 


Ulcers, research on, 533-536 
Universal Motion Analyzer, 180-181 


Validity, 28-34 
of clinical tests, 396—400 
coefficient, 32 
concurrent, 28 
of counseling tests, 447—449 
of diagnostic indices, in medicine, 531 
differential, 38-39, 61—62 
illusory, 39-40 
predictive, 28 
Values, in clinical and counseling psychol- 
ogy, 341 
in concepts of mental health, 389 
in counseling, 453, 470 
in evaluation of outcomes of therapy, 406 
in occupational choice, 459, 470 
Ventilation, 200-203 
Vestibule school, 108-109 
Vibratese, 232 
Vigilance, 166, 213 
VISTA (Visual Testing Apparatus), 257, 
321-323 
Visual efficiency, of ads, 321-323 
indices of, 193-196 
Visualization, in motivation research, 297- 
300 
Vocabulary, and reading ease, 331-332 
Vocational adjustment, 474 
Vocational counseling, 428, 458-477 
Vocational development, 471-476 
stages of, 473 
Vocational maturity, 474 
Volunteer error, 270 


Ward behavior, measures of, 405 

Warming-up period, in work curves, 164, 17° 

Weather, psychological effects, 203-204 

Wechsler Adult Intelligence Scale, 353-354 
deterioration index, 355 
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Wechsler Intelligence Scale for Children, 


353 
Weightlessness, 205-206 
Wesman Personnel Classification Test, 45- 
6 


4 
White space, in advertising, 313-314, 322 
Wonderlic Personnel Test, 45 
Work, spatial layout of, 154-155, 223-226, 
241-243 


Subject Index 


Work curve, 163-164 

Work week, length of, 171-172 

Working hours (see Work week) 

Written inquiries, in study of advertising 
effectiveness, 258, 319-320 


You: Today and Tomorrow, 471-472, 475 
Zero gravity, 205-206 
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